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Abstract 

Representing defeasibility is an important issue in common sense reasoning. In reasoning 
about action and change, this issue becomes more difficult because domain and action re- 
lated defeasible information may conflict with general inertia rules. Furthermore, different 
types of defeasible information may also interfere with each other during the reasoning. In 
this paper, we develop a prioritized logic programming approach to handle defeasibilities 
in reasoning about action. In particular, we propose three action languages AT , AT 1 
and AT 2 which handle three types of defeasibilities in action domains named defeasible 
constraints, defeasible observations and actions with defeasible and abnormal effects re- 
spectively. Each language with a higher superscript can be viewed as an extension of the 
language with a lower superscript. These action languages inherit the simple syntax of A 
language but their semantics is developed in terms of transition systems where transition 
functions are defined based on prioritized logic programs. By illustrating various examples, 
we show that our approach eventually provides a powerful mechanism to handle various 
defeasibilities in temporal prediction and postdiction. We also investigate semantic proper- 
ties of these three action languages and characterize classes of action domains that present 
more desirable solutions in reasoning about action within the underlying action languages. 



1 Introduction 

Representing defeasibility is an important issue in common sense reasoning. In rea- 
soning about action, this issue becomes more difficult because domain and action 
related defeasible information may conflict with general inertia rules - that are 
necessary to specify things that persist with respect to actions and usually defeasi- 
ble as well. Furthermore, different types of defeasible information may also interfere 
with each other during the reasoning. Therefore, most previous action theories usu- 
ally ignored such defeasible information in problem domains. However, recent work 
on causality reveals that in many situations defeasibility plays an important role in 
temporal prediction and postdiction and ignoring this issue may cause difficulties 
in deriving correct solutions in reasoning about action. 



Let us consider the Switch-Power domain that was first addressed in (Zhang, 



1999), where two domain constraints were taken into account: 

if the switch is on, then the light is usually on; (1) 
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if there is no power, then the light is not on. (2) 

Intuitively, the first constraint is defeasible from our common sense. For instance , 
even if the switch is on, the light might not be on if there is no power, or there is a 
problem in the circuit, and so on. But if this constraint is not expressed as a defea- 
sible rule, we may have a difficulty in our reasoning. Suppose we simply represent 
the above two constraints as logical implications Switch D On and -^Power D ~^On 
respectively. If the initial state is {On, Power, Switch} and the robot is asked to 
perform an action Cut-Power with effect -^Power (e.g. a fire alarm leads the robot 
to perform this action). Clearly, Cut-Power will cause a direct effect ->Power, and 
then from constraint Switch D On and ->Power D ->On, an indirect effect -^Switch 
is derived. Obviously, this effect is not quite reasonable from our intuition as cutting 
off the power should be irrelevant to the switch's position. 

People may argue that the above problem is due to the duality of logical impli- 
cation (i.e. A D B = -^B D -iA). Now suppose we adopt McCain and Turn's causal 
theory ( McCain fc Turner, 1995| ) where constraints (jl]) and (|2|) are represented as 



inference rules Switch =>■ On and -^Power =>■ —>On respectively]^] Then under the 
same initial state as above, it turns out that action Cut-Power becomes unexe- 
cutable because the effect ->Power together with rule ->Power =>■ ->On contradicts 
fact On which is derivable from fact Switch and rule Switch =>■ On. This is not a 
desirable solution either. 

The above example just illustrates one type of defeasibility - defeasible con- 
straints, which causes difficulties in reasoning about action. In fact, there are other 
types of defeasible information, such as defeasible observations and actions with 
defeasible and abnormal effects, that also significantly influence temporal predic- 
tion and postdiction. Although the problem of defeasibilities has been investigated 



by some researchers recently, e.g. ( Baral fc Lobo, 1997 ; Gcffncr, 1997 ; lablonowski 



et ai, 199q ; |Zhang, 1999|) , none of the previous proposals is completely satisfactory 
in terms of representing and handling different types of defeasibilities in temporal 
reasoning (we will discuss this issue in section 7). 

In this paper, we address three basic types of defeasible information related to 
temporal prediction and postdiction where incomplete information is allowable: 
defeasible constraints, defeasible observations and actions with defeasible and ab- 
normal effects. Our goal is to handle these three types of defeasibilities in reasoning 
about action under a unified framework of logic programming. 

The issue of representing action in logic programming languages is not new. It was 



explored by some researchers previously, e.g. (Eshghi fc Kowalski, 1989]). However, 



probably Gelfond and Lifschitz's work ( Gelfond fc Lifschitz, 1995| ) was the first 



time to make a major progress in this direction. By introducing a simple action 
language A, Gelfond and Lifschitz's action formulation was able to deal with both 
temporal prediction and postdiction, while properties of actions were characterized 
by translating action language A into the language of extended logic programs 



1 Informally, A = » R represents a semant ics like "if A then B" , from which we cannot derive 
=> -iA. See (McCain & Turner, 1995) for detail. 
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(Gelfond & Lifschitz, 1991). In other words, in Gelfond and Lifschitz's formulation, 
extended logic program was used as an implementation of the high level action 
language A. 

It has been recognized that logic programming can not only be used as the 
implementation of a high level action language, but also can be used as a basis for 
providing a formal semantics of the high level language, e.g. (Baral & Lobo, 1997). 
In this paper, we further demonstrate that prioritized logic programming has a 
great flexibility to serve as a semantic basis to develop high level action languages 
that handle various information conflicts in reasoning about action. The paper 
is organized as follows. Section 2 briefly reviews the concept of prioritized logic 
programs. Section 3 proposes a simple action language AT which can represent 
actions in domains with defeasible constraints. The syntax of AT is similar to that 
of A style action languages. A transition system is proposed to provide a formal 
semantics of AT , where a corresponding prioritized logic program is employed as a 
basis for defining such a transition system. Section 4 then extends action language 
AT to AT 1 so that it can represent defeasible observations and shows how it 
handles the problem of temporal postdiction under the occurrence of defeasible 
observations. Section 5 further generalizes AT 1 to action language AT 2 to represent 
actions with defeasible and abnormal effects. Section 6 then investigates various 
properties of action languages AT , AT 1 and AT 2 and characterize specific classes 
of action domains that may present desirable solutions in reasoning about action. 
Section 7 discusses related work, and finally section 8 concludes the paper with 
some remarks. 



2 Prioritized Logic Programs (PLPs): An Overview 

We first introduce the extended logic program and its answer set semantics de- 



veloped by Gelfond and Lifschitz ( Gelfond fc Lifschitz, 1991 ). A language C of 



extended logic programs is determined by its object constants, function constants 
and predicate constants. Terms are built as in the corresponding first order lan- 
guage; atoms have the form P{t\, • ■ • , t n ), where ti (1 < i < n) is a term and P is a 
predicate constant of arity n; a literal is either an atom P(t±, ■ • ■ , t n ) or a negative 
atom ->P(ii, • ■ • , t n )- A rule is an expression of the form: 

Lq *— Li, ■ ■ ■ , L m , notL m+ i, • • • , notL n , (3) 

where each T (0 < i < n) is a literal. Lq is called the head of the rule, while 
{Li, • • ■ , L m , notL m+ i, ■ ■ •, notL n } is called the body of the rule. Obviously, the 
body of a rule could be empty. We also allow the head of a rule to be empty. In this 
case, the rule with an empty head is called constraint. A term, atom, literal, or rule 
is ground if no variable occurs in it. An extended logic program II is a collection of 
rules. II is ground if each rule in II is ground. 

To evaluate an extended logic program, Gelfond and Lifschitz proposed an an- 
swer set semantics for extended logic programs. Let II be a ground extended logic 
program not containing not and Lit the set of all ground literals in the language 
of II. An answer set of II is the smallest subset S of Lit such that (i) for any rule 



4 



Y. Zhang 



Lq «— £>!,•••, L m from II, if L\, ■ ■ ■ , L m € S, then Lq 6 S; and (ii) if S contains 
a pair of complementary literals, then S = Lit. Now let II be a ground arbitrary 
extended logic program. For any subset S of Lit, let IP be the logic program ob- 
tained from II by deleting (i) each rule that has a formula not L in its body with 
L G S, and (ii) all formulas of the form not L in the bodies of the remaining rules^]. 
We define that S is an answer set of II iff S is an answer set of II s . 

For a non-ground extended logic program II, we usually view a rule in II con- 
taining variables to be the set of all ground instances of this rule formed from the 
set of ground literals in the language. The collection of all these ground rules forms 
the ground instantiation II' of II. Then a set of ground literals is an answer set of 
II if and only if it is an answer set of II'. It is easy to see that an extended logic 
program may have one, more than one, or no answer set at all. 

A prioritized logic program (PLP) V is a triple (II, TV, <) , where IT is an extended 
logic program, M is a naming function mapping each rule in II to a name, and < is 
a strict partial ordering on names. The partial ordering < in V plays an essential 
role in the evaluation of V. We also use V(<) to denote the set of Orelations of 
V . Intuitively < represents a preference of applying rules during the evaluation of 
the program. In particular, if N{r) < A/"(r') holds in V, rule r would be preferred 
to apply over rule r' during the evaluation of V (i.e. rule r is more preferred than 
rule r 1 ). Consider the following classical example represented in our formalism: 

Vx = <JL,N,<): 

Ni : Fly(x) <- Btrd(x), not ->Fly(x), 
N2 : ->Fly(x) *— Penguin(x), not Fly(x), 
N 3 : Bird(Tweety) <—, 
Ni : Penguin (Tweety) «— , 
N 2 < JVi. 

Obviously, rules Ni and N2 conflict with each other as their heads are complemen- 
tary literals, and applying Ni will defeat AT 2 and vice versa. However, as N2 < N\, 
we would expect that rule N% is preferred to apply first and then defeat rule N± so 
that the desired solution -> Fly (Tweety) can be derived. 

Definition 1 

Let II be a ground extended logic program and r a ground rule of the form (^) (r 
does not necessarily belong to II). Rule r is defeated by II iff II has an answer set 
and for any answer set S of II, there exists some Li £ S, where m + 1 < i < n. 

Now our idea of evaluating a PLP is as follows. Let V — (IT, N, <). If there are 
two rules r and r' in IT and Af(r) < Af(r'), r' will be ignored in the evaluation of 
V, only if keeping r in IT and deleting r' from IT will result in a defeat of r' . By 
eliminating all such potential rules from IT, V is eventually reduced to an extended 
logic program in which the partial ordering < has been removed. Our evaluation 
for V is then based on this reduced extended logic program. 

Similarly to the case of extended logic programs, the evaluation of a PLP will 
be based on its ground form. We say that a PLP V' = (IT', A/"', <') is the ground 

2 We also call II s the Gelfond-Lifschitz transformation of El in terms of S. 
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instantiation of V = (II, TV, <) if (1) II' is the ground instantiation of II; and (2) 
TV'(r() <' N'irt,) e V'(<') if and only if there exist rules r\ and r 2 in II such that r[ 
and r 2 are ground instances of r\ and r 2 respectively and TV(ri) < TV(r 2 ) € V(<). 
Under this definition, however, we require a restriction on a PLP since not every 
PLP's ground instantiation presents a consistent information with respect to the 
original PLP. Consider a PLP as follows: 

Ni : P(f(x)) <- notP(x), 

N 2 : P(f(f(x))) - notP(f(x)), 

N 2 < Ni. 

If the only constant in the language is 0, then the set of ground instances of N\ and 
N2 includes rules like: 

Ni : P(/(0)) <- notP(0), 
m : P(.f(f(0))) «- notP(f(0)), 
Ni : P(/(/(/(0)))) <- notP(f(f(0))), 

"3 

It is easy to see that A^' can be viewed as an instance for both Ni and TV 2 . Therefore, 
the ordering <' among rules N[, N^, A/3, ■ ■ ■ is no longer a strict partial ordering 
because of <' A^'. Obviously, we need to exclude this kind of programs in our 
context. On the other hand, we also want to avoid a situation like • • • <' A^' <' 
N2 <' N[ in the ground prioritized logic program because this <' indicates that 
there is no most preferred rule in the program. 

Given a PLP V = (II, TV, <). We say that V is well formed if there is no rule r' 
that is an instance of two different rules r± and r 2 in II and Af(r\) < N{r%) £ V{<). 
Then it is not difficult to observe that the following fact holds. 

Fact: If a PLP V = (II, TV, <) is well formed, then in its ground instantiation V' = 
(n'jjV', <'), <' is a partial ordering and every non-empty subset of II' has a least element 
with respect to <'. 

Due to the above fact, in the rest of this paper, we will only consider well formed 
PLP programs in our discussions, and consequently, the evaluation for an arbitrary 
PLP V = (n,JV,<) will be based on its ground instantiation V = (W,J\f ',<'). 
Therefore, in our context a ground prioritized (or extended) logic program may 
contain infinite number of rules. In this case, we will assume that this ground pro- 
gram is the ground instantiation of some program that only contains finite number 
of rules. In the rest of the paper, whenever there is no confusion, we will only 
consider ground prioritized (extended) logic programs without explicit declaration. 

Definition 2 



( Zhang fc Foo, 1997a ) Let V — (II, TV, <) be a prioritized logic program. V < is 
a reduct of V with respect to < if and only if there exists a sequence of sets IT 
(i = 0, 1, • • •) such that: 

1. n = n ; 

2. Hi = Hi-i — {r%, T2, ■ ■ ■ I (a) there exists r e IL_i such that 

for every j (j = 1, 2, • • ■), TV(r) < M{r 3 ) e V{<) and 

ri, r 2 , • ■ • are defeated by IL_i — {r*i, ra, • • •}, and (b) there 
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are no rules r', r", ■ ■ ■ 6 EEf—i such that N(rj) < N(r'), 
N(rj) < N(r"), • • • for some j (j = 1, 2, • • •) and r', r", • • • 
are defeated by IL_i — {r', r", • • •}}; 

3.p < =n~o n i- 

In Definition 2, "P*- is an extended logic program obtained from II by eliminating 
some rules from II. In particular, if J\f(r) < J\f(ri), Af(r) < J\f(r2), ■ ■ ■, and IIj_i — 
{fi, T2, ■ ■ •} defeats {r±, T2, ■ • •}, then rules r\, r^, • • • will be eliminated from IIj_i if 
no -/ess preferred rule can be eliminated (i.e. conditions (a) and (b)). This procedure 
is continued until a fixed point is reached. It should be noted that condition (b) in 
the above definition is necessary because without it some unintuitive results may be 
derived. For instance, consider V\ again, if we add additional preference N3 < N2 
in Vi, then using a modified version of Definition 2 without condition (b), 

{Fly(Tweety) <- Bird(Tweety),not-<Fly(Tweety), 
Bird(Tweety) <— , 
Penguin(Tweety) ^} 

is a reduct of V\, from which we will conclude that Tweety can fly. 

Theorem 1 

Every PLP has a reduct. 
Definition 3 



( ^hang fc Foo, 1997a| ) Let V = (II, TV, <) be a PLP and Lit the set of all ground 
literals in the language of V . For any subset S of Lit, S is an answer set of V iff S 
is an answer set for some reduct V < of V. 

Example 1 

Using Definitions 2 and 3, it is easy to conclude that V\ has a unique reduct as 
follows: 

V\ = Fly (Tweety) *— Penguin (Tweety) , not Fly(Tweety), 
Bird(Tweety) «— , 
Pengmn(Tweety) <— }, 

from which we obtain the following answer set of V\\ 

S = {Bird(Tweety), Penguin (Tweety) , ->Fly (Tweety)}. 

Now we consider another program V2 '■ 

Ni : A <-, 

N 2 : B <- not C, 

A^s : D <-, 

Ni : C <- not 5, 

ATi < iV 2l iV3 < iV4. 

According to Definition 2, it is easy to see that V2 has two reducts: 

{A <-, D <-, C <- not B}, and 
{A <—, B <- not C, D <-}. 

From Definition 3, it follows that V2 has two answer sets: {^4, C, D} and {^4, B, D}. 
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3 AT : Representing Actions in Domains with Defeasible Constraints 

In this section, we develop an action language AT which is able to handle domains 
with defeasible constraints. The syntax of language AT is inspired by A family 
languages, and a transition system will be developed to provide the semantics of 
AT where a corresponding prioritized logic program is employed to define the 
transition function. 



3. 1 Syntax of AT" 

The language AT has two disjoint sets of names called actions and fluents. We 
will use A, Ai, A 2 , ■ • ■ to denote action names , and F, F±, F 2 , • • • to denote fluent 
names. We define a fluent expression to be a fluent name possibly preceded by a 
negation sign -i. 

A vaiue proposition is an expression of the form: 

L after A lr --,A U (4) 

where L is a fluent expression and A±, • ■ ■ , Ai are action names. A value proposition 
is also called an initial proposition if no action name occurs in it: 

initially L ■ (5) 

A causal proposition is an expression of the form: 

L is caused if Li, • • • , L m with absence L m+ i, • • • , L n , (6) 

where L, L\, ■ ■ ■ , L n are fluent expressions. This is so-called defeasible constraint 
whose intuitive meaning is that L is caused to be true if L\ , ■ ■ ■ , L m are true and 
L m +i, ■ ■ ■ ,L n are not present. As a special case, @ is reduced to a non-defeasible 
causal rule if no absent fluent expression is mentioned: 

L is caused if L%, ■ ■ ■ , L m (7) 

An action effect proposition is an expression of the form: 

A causes L if L\, ■ ■ ■ , L^, (8) 

where A is an action name and Li, ■ ■ ■ , are fluent expressions. (J^j means that if 
preconditions L\, ■ ■ ■ , of A are true, then action A causes L to be true. Note the 
difference between (^) and (^) while no action is involved in the former. 

Now we define a domain description V of AT to be a finite set of initial propo- 
sitions, causal propositions and action effect propositions. It should be noted that 
here we do not include value propositions of the form (Q) into a domain descrip- 
tion since at the moment we restrict our formulation only to deal with prediction 
reasoning while a value proposition is only used as a query statement in the 
language^. The following example shows how language AT is used to describe an 
action domain. 



3 This restriction will be released in language AT 1 . 
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Example 2 

Let us consider the Switch-Power domain mentioned in section 1 again. The domain 
includes two constraints: (a) if the switch is on, then the light is usually on; (b) 
if there is no power, then the light is not on. We treat the first constraint as a 
defeasible causal rule. We also suppose that initially the light is on, there is power 
and the switch is on. An action Cut-Power is then performed. It has been shown 
that the previous approaches have difficulties to deal with this example due to a 
lack of expressibility of defeasible constraints ( [Zhang, 1999 ). This action scenario 



can be described by specifying a domain description T>(Switch- Power) of AT as 
follows. Firstly, T>(Switch- Power) contains the following three initial propositions: 

initially On, 
initially Power, 
initially Swttch. 

V( Switch- Power) also includes the following two causal propositions to capture the 
domain constraints presented above: 

On is caused if Swttch with absence -iOn, 
-^On is caused if -^Power. 

Finally, V(Switch- Power) has one action effect proposition: 
Cut-Power causes -^Power. 



3.2 Semantics of AT 

Similarly to the idea presented in ( Gelfond fc Lifschitz, 1993|) , we will define a 
transition system to provide a formal semantics for AT . However, instead of de- 
veloping an independent transition system for the language, our transition function 
is defined based on the PLP. This is because the PLP has a powerful mechanism 
of solving conflicts between defeasible information, which, from our observation, is 
difficult to handle in the traditional transition system approach. 



3.2.1 Translating AT into PLP 

We first propose a translation from a domain description V of AT into a PLP, and 
our transition function will be defined based on this translated PLP. To implement 
this translation, we consider a language C F Ar a of PLPs including the following 
vocabulary: 

- Situation sort: one situation constant So, and situation variables s, s±, S2, • • ■■ 

- Action sort: action constants A, A\, Ai, • ■ ■, and action variables a, a\, ai, ■ ■ ■. 

- Propositions! fluent sort: fluent constants F , F\, F2, • ■ ■, and fluent variables 
U1J2, ■ ■ ■■ 

- Function symbol: a binary function symbol Result which takes arguments of 
action and situation respectively, and returns a situation. 

- Predicate symbols: five binary predicate symbols Holds, Caused + , Causeal~ , 
Effect + and Effect~ , all of which take arguments of fluent and situation re- 
spectively. 
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In £^q-o situation term Result(a, s) indicates the resulting situation after per- 
forming action a in s. Atom Holds(f , s) (or literal ->Holds(f , s)) indicates the 
fact that fluent / is true (or false, resp.) in situation s. Atom Caused + (f,s) (or 
Caused~ (/ , s) resp.) indicates that fluent / is caused to be true (or false, resp.) in 
situation s. Causal rules in C-^j-q have the following forms^| 



Caused + (F , s) 

Caused" (F , s) 

Holds (f,s) <- 
^Holds(J,s) < 



[-^]Holds{F 1 , s), ■ ■ ■ , [^} Holds (F m ,s), 

not [-1] Holds (F m+ i, s), not [->] Holds (F n , s), (9) 

HiToZdsCFx, s), • • • , HiToZds(.F m , s), 

not [-i]_ffo/c?s(i 7 ' m +i, s), • • • , no< [-1] Holds (F n , s), (10) 



Caused + (/ , s), 
Caused~ (/ , s) 



(11) 
(12) 



Basically, rule ([)]) together with rule ([TT|) (or (|To|) together with ( |l2| ) resp.) says 
that if literals [-i]ffoWs(Fi, s), • • •, [-iJ-ffoZds^m, s) are true, and there is no explicit 
statement saying that [-i]/?oMs(F m+ i,s), • • •, [->] Holds (F n , s) are true, then fluent F 
is caused to be true (or false resp.) in situation s. As it will be seen, in our following 
translation, rules (|J) and (|l(]) are actually related to the domain description T> and 
hence are domain specific, while rules ( |lT| ) and ( |l2| ) act as generic rule schemas that 
are irrelevant to T> and hence are domain independent. For simplicity, we denote 



fl 



ind 



{©,©}• 



Atoms Effect + (f, s) and Effect (/, s) are used to represent direct effects of ac- 
tions. Generally, action effect rules have the following forms: 



Effect+(F,Result(A,s)) 



]Holds(F!,s),' 



]Holds(F k ,s), 



(13) 



Effect" (F,Result(A,s)) <- [-.]ifoJds(Fi, s), • • • , [-*]Holds(F k , s), (14) 
Holds(f,s) «- Effect+(f,s), (15) 

-,Holds(f,s) «- Effect- {f, a)- (16) 

Intuitively, rule (|l3|) together with rule ( |l5| ) (or ([[J]) together with ( |l6| ) resp.) 
says that if action yl's preconditions [-i]-ffoZcfe(Fi, s), • • •, [-i]ifoZds(Ffc, s) are true, 
then fluent F becomes true (or false resp.) after performing action A. Again, rules 
( |l3| ) and (|l4|) are domain specific, while rules (|l5|) and ( p^ ) represent domain inde- 
pendent schemas. Similarly, we denote 

n& = {©,©}• 



4 Notation [-1] means that the negation sign -1 may or may not occur. 
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Definition 4 

A PLP is called a translation of domain description V of ^4T°, denoted by (V) = 
(II, A/", <), if it is obtained as follows: 

1. II consists of the following rules: 

Initial fact rules : For each initial proposition (H) in T>, there is a rule of the 
form [-]Holds(F,So) <-fl 

Causal rules : For each causal proposition (0) in T>, there is a causal rule of 
the form (0) or (|l0|). Two domain independent causal rules (|ll| ) and (|l2| ) arc 
also included in this set. 

Action effect rules : For each action effect proposition (||), there is an action 
effect rule of the form (Oh or (|l4|). Two domain independent action effect 



rules (15) and (|16|) are also included in this set. 



Inertia rutej : 

Holds(f , Result(a, s)) <— Holds(f , s), not H olds {f , Result (a, s)){17) 



-i H olds (f , Result (a, s)) *— — 'Holds (f , s), not Holds(f , Result{a 1 sjjj.8) 

2. Naming function A/" assigns a unique name to each rule in II. 

3. For each causal rule A c and each inertia rule AT,, < -relation A c < A, holds. 

In ■p- 4r °(2?) specified above, II represents initial facts, domain constraints (causal 
rules) and action effects corresponding to V, and inertia rules are used to capture 
things that do not change with respect to actions. Since we allow to represent 
defeasible causal rules while inertia rules are also defeasible, possible conflicts may 
occur between these two types of rules. To solve such conflicts, we specify that a 
causal rule is more preferred than an inertia rule. The intuition behind this is clear: 
generally causal rules are used to derive indirect effects of actions, and whenever 
there is no explicit condition to block a defeasible causal rule, this rule should be 
triggered to derive necessary indirect effects. This point is illustrated in Example 3 
next. 

It is also obvious that to translate a specific domain description V into V Ar (T>) , 
we only need to translate domain specific information such as initial propositions, 
causal and action effect propositions into logic program rules, while other domain 
independent schema rules such as H| nd , H^d and rules ( |l7| ) and ( fL8|) arc automati- 
cally embedded in every translated PLP. Formally, in a given V AT (V) , we denote 
a set of domain specific rules as n^ ec , and specify the set of domain independent 
rules as 

n° nd = n? nd unf d ui4 d , (19) 

where W md ={©,©}• 

5 Here Holds(F, So) or ^Holds(F, So) is corresponding to whether L occurring in initially L is 
F or —iF respectively. This assumption is also adopted in the rest of this paper. 

6 Note that these two inertia rules actually represent a set of inertia rules by substituting fluent 
and action variables / and a with every fluent and action constants occurring in the domain 
respectively. 
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Example 3 

Example 2 continued. According to Definition 4, the domain description T>(Switch- Power) 
presented in Example 2 can be translated into a PLP, denoted by V AT (Switch-Power) 
= (Hspec U Af, <), where H° spec consists of the following rules: 
Initial fact rules: 



Holds(On, Sq) <— , 
Holds (Power , Sq) 
Holds(Switch, Sq) 



N 2 
N 3 

Causal rules : 

N4 : Caused + (On, s) <— Holds (Switch, s), not ->Holds(On, s), 
N5 : Caused' (On, s) <— -^Holds (Power , s), 
Action effect rule : 

Ne : Effect' (Power, Result(Cut- Power , s)) *—. 
Naming rules in n° nd : 

Assigning a unique name to each rule in n° nd . That is, we assign names N?, Ng, 
N 9 ,N 10 ,N U and N 12 to rules @, ©, (£§) ©, @ and © respectively. 
< -relations : 

N c < Ni, while N c and Ni are names of any causal rule and inertia rule in II 
respectively. That is, we have {N 4 , N 5 , N 7 , N$} < {Nn,N 12 }f\. 



3.2.2 Transition junction, models and entailment 

To define the transition function, we first introduce the concept of state. A state is 
a collection of fluent expressions. A state is consistent if it does not contain a fluent 
F and its negative correspondent -iF. We use symbols So, Si, S 2 , ■■■ to denote 
states. Then transition function 1Z maps a state to a power set of states by some 
action. 

Definition 5 

Given a domain description T> and its translation V Ar (T>), let A be the set of 
all answer sets of V Ar (T>). The transition function 1Z(A, S) of V with respect to 
action A and state S is defined as follows: 

1. If A is empty or includes an inconsistent answer set of V Ar0 (V), then 1Z(A, S) 
is undefined; 

2. S = {HF I [-^]Holds(F, S ) G Ans}, where Ans £ A; 

3. K(A, S) = {{HF I HHolds(F, Result (A, S')) € Ans, and for any F' 

HF' e S iff [-]Holda(F', S') s Ans} \ Ans G A}. 

It should be noted that we define a state to be a collection of fluent expressions, 
that is very different from the state defined in standard ,4-style action languages 
where states correspond to possible physical worlds and every fluent is either true or 



false in a state (Gelfond & Lifschitz, 1993). In our context, a state may not present 



7 This is an abbreviation of a set of <-relations of the form JVj < Nj, where i = 4,5,7,8 and 
j = 11, 12. 
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a complete information for fluents. If a fluent is not present in a state, then this 
fluent's truth value is viewed as unknown. Defining states in this way will bring us 
a flexibility to develop a formal semantics for our action theories where incomplete 
information related to defeasibility is admitted. 

In Definition 5, So is called the initial state of V, and 1Z(A, S) represents the 
set of all possible states resulting from the execution of A on state S. From the 
feature of V Ar (£>), it is quite obvious that the initial state So is always unique. 
On the other hand, TZ(A, S) may include more than one state. To see how transition 
function 1Z works, we consider a domain V consisting of the following propositions: 

initially F\, 

F2 is caused if -1F1 with absence -1F2, 
-1F2 is caused if -1F1 with absence F2, 
A\ causes ->F\, 
A2 causes -1.F2 if -*Fi, F2. 

Since two causal propositions conflict with each other and action A\ is executable 
in the initial situation, it is not difficult to see that 2?'s PLP translation V Ar (£>) 
has two different answer sets such that Holds (F 2 , Result (Ai, So)) is in one and 
-iHolds(F2, Result{A\ 1 So)) is in the other. Then from Definition 5, state transitions 
of V specified by transition function 1Z can be described by the following diagram, 
where {Fi} is the initial state: 




Fig. 1. State transitions. 
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Let A denote an action string A\ - • • A\ (as a special case, an empty action string 
is denoted as e). A structure \P is a partial function from strings of actions to states 
whose domain is prefix closed. We refer 9(e) = So, i.e. the initial state of T>. The 
following definition describes possible trajectories of the dynamic system (domain 
description) defined in AT under structure 9. 

Definition 6 

Given a structure 9. 

1. An initial proposition of the form (0) is satisfied in \& if i £ \I/(e); 

2. A causal proposition of the form (|6|) or an action effect proposition of the 
form (||) is satisfied in \I> if the following conditions hold: 

- for any action string A and action constant A, if 9(A) and 1Z(A,9(A)) 
are defined, then 9(1 ■ A) e K(A, 9(A)); 

- otherwise 9 (A • A) is undefined. 

A fluent expression L is true in a state 9(A) if L 6 9(A). 



Definition 7 

Given a domain description T>, a structure 9 is a model of T> if all initial, causal 
and action effect propositions in T> are satisfied in 9, and for any action string A 
and fluent F, F and ->F are not both true in ^'(^4). We say a value proposition of 
the form (g): L after A±,---,Ai is satisfied in * if I e *(^4), where A — A x ■ ■ ■ A t . 
We say V entails value proposition (^), denoted as V ^ after A\, ■ ■ ■ , Ai, if 

(|]) is satisfied in all models of V. 

Example 4 

Example 3 continued. From T>(Switch- Power'Ys PLP translation 

(Switch- Power) as shown in Example 3, it can be verified that V AT (Switch- Power) 
has a unique answer set that includes the following ground literals^} 

Holds(On,S ), 
Holds(Power, So), 
Holds(Switch, So), 

-^Holds(Power, Result(Cut- Power , So)), 
-^Holds(On, Result(Cut- Power , So)) and 
Holds(Switch, Result(Cut- Power , So)). 

Since So = {On, Power, Switch}, we have TL(Cut-Power, Sq) = {Si}, where 
S% = {^On,^Power, Switch}. Now it is easy to see that structure 9 is a model 
of V(Switch- Power), where 9(e) = So and 9(Cut-Power) = S\. Furthermore, 
according to Definition 7, we have 

V(Switch- Power) \=at° ^Power after Cut-Power, 
T>(Switch- Power) (=^7-0 ->On after Cut-Power, 
V(Switch-Power) \=at° Switch after Cut-Power. 



Obviously the answer set also includes many other ground literals that we are not interested in 
listing here. 
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Now we slightly modify the domain of Switch-Power as stated in Example 2. Sup- 
pose initially the light is not on and the switch is off, and another action Turn- On 
is also available. Then the modified domain description V '(Switch- Power') includes 
the following initial propositions: 

initially -<On, 
initially -^Switch, 

and the action effect proposition 
Turn- On causes Switch, 

together with the effect proposition of action Cut-Power and two causal proposi- 
tions as given in Example 2. Ignoring the detail, we can derive the following results: 

T)(Switch-Power') \=at° On after Turn-On, 
D(Switch-Power') \=at° ^On after Turn-On, Cut-Power, 
V(Switch-Power') \=at° Switch after Turn-On, Cut-Power. 



4 AT 1 : Combining Defeasible Observations into Action Domains 

We have shown that language AT handles temporal prediction where defeasible 
constraints are admitted. It, however, cannot deal with temporal postdiction, e.g. 
within the framework of AT we cannot reason from the current state to the past 
under some observations. It has been realized that observations on any intermidate 
states (including the final state) play an important role in temporal postdiction 
( lablonowski et al, 1996 ). Here, an observation is viewed as an agent's beliefs about 
the domain that is either obtained from the outside world or from the agent's own 
assumption. In the case that an agent makes an observation under some assumption, 
such observation becomes defeasible because once the assumption is proved not to 
be true, the agent's observation should be defeated. 

In this section, we extend AT to AT 1 such that the extended language can 
handle temporal prediction and postdiction where both defeasible constraints and 
observations are admitted. 



4.1 Syntax of AT 1 

The syntax of AT 1 is the same as AT 0, s except that AT 1 also has an observation 
proposition of the form: 

L is observed if L\, ■ ■ ■ , L m with absence L m+ i, ■ ■ ■ ,L n after A\, • • • , Ai, (20) 

where L, L\, ■ ■ ■ , L n are fluent expressions, and A%, ■ ■ ■ , A\ are actions. Intuitively, 
( pp[ ) says after actions A\, - ■ ■ ,Ai are performed sequentially, L is observed to be 
true if Li, ■ ■ ■ , L m are true while L m+ i, ■ ■ ■ , L n are absent. Obviously, (EOh represents 
a kind of defeasible information. In the case that no action occurs in (|20|), ( p0| ) can 
be written as the following form: 

initially L is observed if L\, ■ ■ ■ , L m with absence L m+ i, ■ ■ ■ ,L n ■ (21) 

Under the language AT , we define a domain description V to be a finite set of 
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observation propositions, causal propositions and action effect propositions. AT 
will still have the value proposition (^) and its special case the initial proposition 
, but are only used as query statements in AT . 

Example 5 

Let us consider a modified shooting action scenario which we name Shooting-l. 
Suppose the turkey is observed alive in the initial situation, and as there is no 
explicit information about whether the gun is loaded in the initial situation, the 
agent would assume that the gun is initially not loaded by default. After actions 
Shoot and Wait are successively performed, it is observed that the turkey is dead 
(not alive). This scenario can be naturally described by language AT 1 . In particular, 
we specify a domain description V {Shooting-l) which has the following observation 
propositions: 

initially Alive is observed, 

initially -^Loaded is observed with absence Loaded, 
-^Alive is observed after Shoot, Wait, 

and an action effect proposition: 
Shoot causes -lAlive if Loaded. 

4.2 Semantics of AT 1 

We will use a similar way as described in section 3.2 to develop a formal semantics 
of AT 1 based on a transition system that is defined on the basis of the translation 
from a AT 1 domain description into a PLP. 

4-2.1 Translating AT 1 into PLP 

As we have mentioned earlier, the major improvement from AT to AT 1 is that 
we allow defeasible observations to be presented in a domain description so that 
temporal postdiction becomes possible. It is quite straightforward to translate an 
observation proposition of the form ( pfj| ) into the following logic rule: 

[->]Holds(F, S) <- [-]Holds(F 1 ,S),---,[-]Holds(F m ,S), 

not [-]Holds(F m+1 ,S), ■ ■ ■ ,not [^}Holds(F n , S), (22) 

where S — Result{Ai, Result(- ■ ■ , Result (Ai, So) ■ ■ •)). 

To do postdiction reasoning, for each action effect proposition in T>, we need to 
have some action explanation rules which will be used to derive action precondi- 
tions based on proper observations. First, if there is an action effect rule ([l3|), the 
following rule explains that the fact Holds(F, Result {A, s)) is caused by performing 
action A: 

Effect + (F,Result(A,s)) <- Holds(F, Result (A, s)),not Holds(F, s), 

not Caused + (F,Result(A,s)), (23) 

Clearly, the function of rule ( p3|) is to identify action A's actual execution. The 
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intuition is that if fluent F is true (or false, resp.) in situation Result(A, s), and 
there is no explicit information saying that F is true in the previous situation s or 
F is caused to be true by some causal rule, then it derives that F's truth value in 
Result (A, s) is a direct effect of action A. 

Furthermore, if a fluent F is a direct effect of some action A, i.e. Effect + (F, Result(A, s)) 
holds, then each precondition of A must also hold in the previous situation. That 
is, we should have rules like: 

[->]Holds(Fi, s) <- Effect + (F,Result(A,s)), (24) 

where i = 1, • • • , k and A causes F if [ _i ]i 7 'i, • ■ • , H-Ffc is an action effect proposition 
in domain T>. However, it should be noted that sometimes one action may cause the 
same effect under different preconditions. In this case, deriving all possible action 
preconditions may cause contradictions. For instance, consider the following domain 
description D(Door): 

initially -i HasKey, 

DoorOpened is observed after OpenDoor, 
OpenDoor causes DoorOpened if HasCard, 
OpenDoor causes DoorOpened if HasKey. 

In this domain, action OpenDoor has two independent preconditions HasCard and 
HasKey. If we translate this domain according to our proposal above, we will have 
the following logic rules: 

Holds(Door Opened, Result(OpenDoor, So)) <— , 

Effect + (DoorOpened, Result(OpenDoor, s)) «— 

Holds(Door -Opened, Result(OpenDoor, s)), 
not Holds(DoorOpened, s), 

not Caused + (DoorOpened, Result(OpenDoor, s)), 

Holds(HasCard, s) «— Effect + (DoorOpened, Result(OpenDoor, s)), and 
Holds(HasKey , s) <— Effect + (DoorOpened, Result(OpenDoor, s)). 

From the above logic rules, we will deduce both Holds(HasCard, So) and 
Holds(HasKey, So). But from V(Door), we know that ->HasKey initially holds. To 
avoid this kind of contradiction, instead of using rule (p4|), we should have a weaker 
rule to derive action preconditions: whenever there is no conflict, we only deduce 
a minimal number of preconditions to explain an action. Under this principle, wc 
will change rule (|24|) to the following form: 

[-^\Holds(F l ,s) «- Effect+(F,Result(A,s)), 
not[-<]Holds(Fi, s), 

not[-n]Holds(F[, s), ■ ■ ■ , not[-^\Holds(F' l , s), (25) 

where i = 1, • • ■ , k, and fluents [-^)F[, - ■ ■ [~]F[ occur as preconditions in all other 
action effect propositions of A that have the same effectR 



[- 1] Holds (Fi, s) denotes the complementary literal of [-^]Holds(Fi, s). 
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The following rules represent the dual case of rules (|23|) and (25) corresponding 
to action effect rule (|l4|): 

Effect" (F ', Result{A, s)) <- Holds(F,Result(A,s)),not Holds(F,s), 

not Caused + (F,Result(A,s)), (26) 

[->]Holds(Fi,s) <- Effect- (F, Result ( A, s)), 
not[—]Holds(Fi , s), 

not[-]Holds(F{, s), • • ■ , not[->]Holds(F' h s) ■ (27) 

Now the following definition describes the formal translation from a domain de- 
scription V of AT 1 into a PLP . 

Definition 8 

A PLP is called a translation of domain description V of AT 1 , denoted by 7-'- 4T (V) = 
(II, A/", <), if it is obtained as follows: 

1. II consists of the following rules: 

Observation rules : for each observation proposition of ( |20| ) , there is a rule of 
the form @, 

Causal rules : the same as in Definition 4, 
Action effect rules : the same as in Definition 4, 

Action explanation rules: for each action effect rule (|l3|), there are rules (^3|) 



and d25|) , and for each action effect rule (|14|), there are rules (|26j) and fl27|), 



Inertia rules : ([17j), (J18J) and: 

Holds(f,s) <— Holds (/ , Result(a, s)), not ->Holds(f , s), 
not Caused + (/ , Result(a, s)), 

not Effect + (f,Result(a,s)), (28) 

->Holds(f,s) <— ->Holds(f , Result(a, s)), not Holds(f , s), 
not Caused" (/ , Result(a, s)), 

not Effect- (f, Result{a, s)) ■ (29) 

2. Naming function A/" assigns a unique name to each rule in II; 

3. For each observation rule -A G , causal rule A c , action explanation rule 7V e:E and 
inertia rule iVj, the following <-relations hold: 

# e;c <N c <Ni<N - (30) 

Compared with Definition 4, the PLP translation specified in Definition 8 presents 
several new features. First, V Ar (T>) allows to represent defeasible observations 
not only at the initial situation but also at any other situations. Second, V Ar (T>) 



includes action explanation rules (|23|), Q25j), fl26|), and (27). Finally, the extra inertia 
rules (^8|) and ( |29| ) allow us to reason about fluents' truth values from the current 
situation to the past. That is, if a fluent / is true (or false, resp.) currently, and 
there is no explicit information saying that / is not true (or not false, resp.) in the 
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previous situation, or / is caused to be true by some causal rule, or / is true (or 
false, resp.) as a direct effect of some action, then it derives that / is true (or false, 
resp.) in the previous situation. 

Since both observation and action explanation rules may be defeasible, more 
possible conflicts may occur in V AT (T>). For instance, conflicts may not only occur 
between causal rules and inertia rules, but also between action explanation rules 
and inertia rules, observation rules and causal rules, etc.. To solve these possible 
conflicts, the underlying <-relation is specified as (p0[). ( |30| ) presents that action 
explanation rules are most preferred because the execution of an action usually 
override defeasible causal and inertia rules, while observation rules are less preferred 
than inertia rules due to the intuition that a fluent's truth value normally persists 
if there is no explicit action or causal rule to change it. 

Note that in V ATl (T>), action explanation rules (^3|), (f25|), ( p6| ) and ( f27|) are 
domain specific because they are specified based on action effect rules ( |l3|) and ( |l4| ) . 
On the other hand, the new inertia rules (E8h and (|2^) are domain independent. 
Therefore, we can denote domain independent rules in r p- A ' T (T>) as follows: 

nLd = Rind u n| nd u rrj nd , (31) 

where H\ nd — II \ nd U {(p8j), (p9|)}- We also denote the set of domain specific rules 
oiV AT \v) asni pec . 

Example 6 

Example 5 continued. According to Definition 8, it is not difficult to obtain the 

translation of domain description V(Shooting-l), V AT (Shooting-1) = (nj U 

Ii] nd ,Af, <), where Hl pec consists of the following rules: 

Observation rules : 

Nt : Holds(Alive, S ) <—, 

N2 : ->Holds(Loaded, So) <— not Holds(Loaded, So), 
N3 : -1 Holds {Alive, Result (Wait, Result(Shoot , So))) <— , 
Action effect rules : 

-/V4 : Effect^ (Alive, Result(Shoot, s)) <— H olds (Loaded, s), 
Action explanation rules: 

N§ : Effect~ (Alive, Result(Shoot, s)) <— Holds (Alive, Result(Shoot, s)), 

not Holds (Alive, s), 
not Caused^ (Alive, Result(Shoot , s)), 

Nq : H olds (Loaded, s) <— Effect^ (Alive, Result(Shoot, s)), not->Holds(Loaded, s). 
Naming rules in H} nd : 

Assigning a unique name to each rule in II* d . Therefore, we have names N-j, iVg, Ng, N\o, 
N n , N 13 , N 13 , and N u for rules ©, ©, ©, (0), ©, © and © re- 

spectively. 
<-rclations : 

N ex < N t < N Q . That is, we have: 

{N 5 ,N 6 } < {N n ,N 12 ,N 13 ,Nu} < {N^N^Ns}. 
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4-2.2 Transition function, models and entailment 

Transition function 1Z, structures and models ^ are defined in the same way as 
in AT (see section 3.2.2). We denote the entailment relation under ^ in AT 1 
as 1=^47-1 . The only thing we should emphasize is that since we allow a domain 
description to include defeasible initial observation propositions, it is possible that 
one initial observation proposition conflicts with the other. Therefore, different 
initial states So may be deduced from different answer sets of the corresponding 
translated PLP V AT ^ (V) of V. 

Example 7 

Example 5 continued. In the shooting action scenario as described in Example 5, 
the question we are interested in is when the turkey died and whether the gun 
was actually loaded initially. This is a question about postdiction that we need 
to reason from the current situation to the past. After translating the domain 
description T)(Shooting-l) into V Ar (Shooting-1) as illustrated in Example 6, we 
obtain the following results: 

T>(Shooting-l) \=at! -> Alive after Shoot, 
V(Shooting-l) \=at x initially Loaded, 

where the first solution says that the turkey was dead after the execution of action 
Shoot, and the second indicates that initially the gun was actually loaded, which 
defeats the original observation. 



5 AT : Representing Actions with Defeasible and Abnormal Effects 

It is common that in temporal reasoning under some circumstances, an action 
might be abnormally executed and the original expected action effect is defeated. 
Sometimes, an abnormal effect associated with this action may be also produced. 



Consider the classic shooting scenario, e.g. (Sandewall, 1994), in which it is usually 
assumed that if the gun is loaded, then the shoot action causes a direct effect that 
the turkey is not alive. However, it is probably more natural to treat shoot as a 
defeasible action. For instance, if the bullet is dumb, the turkey would be still alive 
after executing action shoot, or it could be an abnormal effect of shoot if after 
shooting the turkey is still alive but the pigeon is dead. In this section, we try to 
further generalize our action language AT 1 to AT 2 in order to capture actions with 
defeasible and/or abnormal effects as described above. 



5.1 Syntax of AT 2 

AT 2 includes the same forms of observation propositions, causal propositions, and 
value propositions of AT , but has different forms of action effect propositions. 
First, an action effect proposition of AT 2 is of the following form: 



A normally causes L if L±, ■ ■ • , Lk, 



(32) 
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where A is an action and L, L\, ■ ■ ■ , Lk are fluent expressions. Intuitively, this action 
effect proposition is defeasible since we consider that if an action is abnormally 
executed, its normal effect then cannot be produced. 

Therefore, the following action abnormal effect proposition represents the abnor- 
mal effect of an action: 

A abnormally causes L if L\, ■ ■ ■ , ■ (33) 

Finally, an abnormal condition proposition represents the condition under which 
an action can be considered to be abnormal: 

A is abnormal if before Li, • • • ,Lh after Lh+i, ■ ■ ■ , L p ■ (34) 

Usually, the abnormality of an action can be identified from observations on the 
changes of some particular fluents' truth values before and after the action execu- 
tion. Hence, (|34| ) says that if L\, ■ ■ ■ ,Lh are true before action A is executed, and 
Lh+i, ■ ■ ■ , L p are true after action A is executed, then A is identified to be abnormal. 

A domain description V of AT 2 is a finite set of observation propositions, causal 
propositions, action effect propositions, abnormal action effect propositions, and 
abnormal condition propositions. The following example shows how we can use 
AT 2 to represent domains where actions may have abnormal or/and defeasible 
effects. 

Example 8 

Let us consider a different shooting scenario named Shooting-2 in which action 
Shoot has a defeasible effect and it is abnormally executed if initially the gun is 
loaded and after performing the action, the turkey is observed still alive. Initially the 
gun is loaded and turkey is alive. This scenario is easy to formalize by using AT 2 . 
We specify a domain description V {Shooting -2) that has the following observation 
propositions: 

initially Loaded is observed, 
initially Alive is observed, 

a defeasible action effect proposition: 

Shoot normallly causes -i Alive if Loaded, 
and an abnormal condition proposition: 

Shoot is abnormal if before Loaded after Alive. 

5.2 Semantics of AT 2 

Similarly to previous languages AT and AT 1 , we will propose a transition system 
to provide a formal semantics of AT 2 . Again, this transition system is defined based 
on a translation from a domain description of AT 2 into a PLP. 
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5.2.1 Translating AT into PLP 

To translate an action domain of AT 2 , we need to extend the language H P Ar a of 
PLPs introduced in section 3.2 to a new language C^T 2 °^ PLPs by adding following 
symbols: 

- Ab: a binary predicate symbol taking arguments action and situation respec- 
tively. 

- AbEffect + and AbEffect' : binary predicate symbols taking arguments fluent 
and situation respectively. 

Intuitively, atom Ab(a, s) expresses that action a is abnormally executed at sit- 
uation s, while atoms AbEffect + (/ , s) and AbEffect+ (/ , s) are used to represent 
abnormal effects of actions (see the following for detail). 

Considering the dcfcasibility of action executions, we need to modify our original 



action effect rules (13) and ( |14| ) presented in section 3.2 to the following forms 
respectively: 

Effect + (F, Result{A, s)) <- [^]Holds{F u s), ■ ■ ■ , [->}Holds(F k , s), 

notAb(A,s), (35) 

Effect' (F, Result ( A, s)) <- [->]Holds(F u s), ■ ■ ■ , [~]Holds(F k , s), 

notAb(A,s), (36) 

Rule ( p5| ) (or ( |36"| ) resp.) says that if A's preconditions [-■] Holds (Fi, s), • • • , \-^Holds{F k , s) 
hold, and there is no explicit information stating that A is abnormally executed at 
situation s, then fluent F will be true (or false, resp.) in situation Result(A, s) as a 
direct effect of A. Additionally we also need a generic schema for any action a: 

-iAb(a,s) «- not Ab{a,s), (37) 

which simply expresses that if there is no explicit information saying that action A 
is abnormally executed at situation s, then it is assume that A is not abnormally 
executed at situation s. To simplify our following presentation, we denote 

m£ = n& u {©r- 



Consequently, action explanation rules (23) and (^) in AT 1 are also modified 
as follows respectively: 

Effect + (F,Result(A,s)) <- Holds(F , Result{A, s)), not Holds(F , s), 

not Caused + (F , Result(A, s)), 
not Ab(A,s), (38) 

Effect' (F , Result (A, s)) <- ^Holds{F , Result {A, a)), not ^Holds{F , s), 

not Caused' {F , Result (A, s)), 
notAb(A,s)- (39) 
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states that if fluent F is true in situation Result(A, s), and there is no evidence 
to show that (a) F is true in the previous situation s; (b) F is caused to be true 
in situation Result(A, s); and (c) action A is abnormal at situation s, then it is 
derived that F must be a direct effect of action A in situation Result(A, s). ( p9| ) 
has a similar interpretation. 

As we mentioned earlier, some actions with defeasible effects may also produce 
abnormal effects. Hence, we also specify action abnormal effect rules of the following 
forms: 



AbEffect + (F, Result{A, s)) 
AbEffect~ (F , Result (A, s)) 
Holds(f,s) <- AbEffect+(f,s), 



[^}Holds(Fi,s),- 
Ab(A,s), 



n]Holds(Fi,s), 



(40) 



-•Holds (f,s) 
Ab(A,s) «- 



AbEffect-(f, 



[-n]Holds(F u s), [-^} Holds (F t , s), 
Ab(A,s), (41) 

(42) 

(43) 



-\Holds(F u s), [-^}Holds(F h , s), 

i]Holds(F h+1 , Result(A, s)), [-^}Holds(F p , Result (A, sQ|4) 



Basicly, rule (|4C| ) (or (|4l| ) resp.) says that if conditions [-i]77oZds(Fi, s), • • •, 
\-]Holds(Fi, s) hold and A is abnormally executed, then fluent F will be true (or 
false resp.) as an abnormal effect of A in situation Result(A, s). Rule (H), on the 
other hand, is a direct translation of abnormal condition proposition (|3J). Clearly, 
rules ( f42|) and ( ^3|) are domain independent while rules (0), (£ll|) and ( ^i[ ) arc 
domain specific. Again for simplicity, we denote 

n£ d = {©,©}• 

Now we are able to describe our translation from a domain description of AT 2 
into a PLP as follows. 

Definition 9 

A PLP is called a translation of domain description V of ^4T 2 , denoted by (T>) = 
(II, N , <), if it obtained as follows: 

1. II consists of following rules: 

Observation rules : the same as in Definition 8. 
Causal rules : the same as in Definition 8. 
Action effect rules : for each action effect proposition 



2f), there is a rule of 



the form ( |35| ) or (36). Three domain independent action effect rules (|1 
and ( p7| ) are also included in this set. 

Action explanation rules: for each action effect rule of the form ( J35[ ) 
are rules ([38]) and (Ba), and for each action effect rule of the form (pq) 
are rules (B9) and (p7|), 
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Action abnormal effect rules : for each action abnormal effect proposition (33) 



there are rules 
there is a rule 
Inertia rules : (I 

Holds(f , s] 




3|), and for each abnormal condition proposition (34) 



), (|isp and: 

Holds(f , Result{a, s)), not ->Holds(f , s), 
not Caused + (/ , Result(a, s)), 
not Effect + (/ , Result (a, s)), 
not AbEffect + (/ , Result (a, s)), 



(45) 



->Holds(f,s) <— ->Holds(f , Result(a, s)), not Holds(f , s), 
not Caused - (/ ', Result (a, s)), 
not Effect~ (/ , Result(a, s)), 
not AbEffect~~ (/ , Result{a, s)); 



(46) 



Naming function TV assigns a unique name to each rule in II; 

For each observation rule N a , causal rule N c , action effect rule N e ff, action 

explanation rule N ex , and inertia rule N , there are <-relations ([30|) : 



and 



N ex < N c < Ni < N 



N eff < N c < Ni < N 



(47) 



Note that inertia rules (^) and ( |46] ) are a natural extension of inertia rules ( p8[ ) 
and (29) in V AT (V) respectively. The <-relations in AT 2 are specified in a similar 
way as in ~P AT (£>) except one more schema (^7|) is added. This is because in 
pAT (jy-j ac ^j on effect rules (Q) and (^6|) are defeasible, possible conflicts between 
these rules and other defeasible rules (e.g. causal rules, inertia rules and observation 



rules) may occur indirectly through the action abnormal effect rule (44). Therefore, 
<-relations (^) (see section 4.2) and (^) are needed as we always assume that an 
action's successful execution should have the highest priority. 
We denote domain independent rules in V Ar (T>) as follows: 



where 

n 2 

spec 



n 2 md = u n£,' u u± u n 



ind 



IE 



ind 



ind i 



(48) 



U 



51) , (fi6j)}, and denote the set of domain specific rules by 



5.2.2 Transition junction, models and entailment 

Transition function 1Z, structures and models $ of AT 2 are defined exactly the 
same as in section 3.2.2. The entailment relation under 5* in AT 2 is denoted as 
^at 2 - Again, it is observed that the initial state of a domain description T> of 
AT may not be unique due to a possible conflict occurring between two defeasible 
initial observation propositions in T>. 
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Example 9 

Example 8 continued. Given the domain description T>(Shooting-2) as presented in 

Example 8, the translated PLP V Ar (Shooting-2) is easy to be obtained according 

to Definition 9. Let V Ar ' (Shooting-2) = (U 2 spec U H 2 nd , TV, <), where U 2 spec consists 

of the following rules: 

Observation rules : 

Ni : Holds (Loaded, So ) <— , 

N 2 : Holds (Alive, S ) <-, 

Action effect rule: 



N% : Effect^ (Alive, Result(Shoot, s)) 
Action explanation rule: 

: Effect^ (Alive, Result(Shoot, s)) 



Action abnormal effect rule: 



N 5 : Ab(Shoot,s) 



Holds(Loaded, s), not Ab(Shoot, s), 

-i Holds (Alive, Result (Shoot , s)), 
not -i Holds ( Alive, s), 
not Caused^ (Alive, Result(Shoot , s)), 
not Ab(Shoot, s), 



Holds(Loaded, s), Holds(Alive, Result(Shoot , s)), 



Naming rules in H 2 nd : Assigning a unique name to each rule in H 2 nd ■ 

< 



-relations : Q and (gjj). 



Since the action effect rule N3 is defeasible, it is not difficult to see that a conflict 
on the truth value of Holds(Alive, Result(Shoot, So)) occurs between rule N$ and 
an inertia rule 

N' : Holds(Ahve, Result(Shoot, s)) <— Holds(Alive, s), 

not -1 Holds (Alive, Result(Shoot, s)) 

which is an instance of the generic inertia rule (17) included in nf„ rf p|. However, 
this conflict is solved by N3 < N'. Therefore, we have the final result 

T> (Shooting-2) \=^t 2 ^Alive after Shoot, 
from which it is concluded that action Shoot is not abnormally executed. 



6 Characterizations of Action Domains 

Among all action domains specified by languages AT , AT 1 and AT 2 , there are 
some classes of action domains that may have more desirable properties than other 
classes of domains. In this section, we investigate these desirable properties and 
characterize different action domains within languages AT , AT 1 and AT 2 respec- 
tively. 

11 Note that the conflict is introduced through rule N$. 
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In particular, we will explore the following questions that are important for eval- 
uating an action formulation: (a) How can we decide whether an action domain 
description is consistent (has a model)? (b) Given an action domain description, 
how is a fluent's truth value affected by executing some acion(s)? (c) Under what 
conditions does the reasoning within an action domain become monotonic? and (d) 
Is it possible to characterize a set of fluents that are temporally definite with respect 
to the underlying action domain description? For instance, if fluent F's truth value 
is known currently, will its truth value be also known after some action or action 
sequence is executed? Furthermore, we will also discuss how to improve our action 
formulation to handle domain dependent preferences so that they can be suited for 
more general cases in reasoning about action. 



6. 1 Consistency of action domains 

In this subsection, we consider the problem of how we can decide if a domain 
description is consistent (has a model). In our semantics development, the transition 
function 1Z is defined based on a translation from the underlying domain description 
I? to a PLP. Hence, it is not difficult obtain a general PLP characterization for 
consistent domain descriptions. 

Proposition 1 

Let T> be a domain description of AT 1 and V Ar (T>) [i = 0, 1,2) the corresponding 
PLP translation of T> specified previously. T> is consistent if and only if V Ar (T>) 
has a consistent answer set. 

Proposition 1, however, can not always be used as a feasible way to decide the 
consistency of a domain description because in general deciding whether a PLP 



has an answer set is NP-complete (Zhang, 2001 J 12 . So it is important to study 



syntactic characterizations on different cases. Our investigation on this issue starts 
from language AT . 



6.1.1 Characterizing consistent action domains of AT 

Given a domain description T> of AT , we first introduce the following notions: 

initial = {F I initially* 1 G Dj, 

initial = {F I initially^ G V}, 

^Effect = i F I A causes F if L\, • • • , L m e V}, 

^Effect = {F I A causes^F if Li, ■ ■ ■ , L m e V}, 

^Cause d = {F\F is caused if...eX>}, 

^Caused = {F\^F is caused if ■ • • G V}. 

For convenience, we use Tj nitial to denote the set containing those complementary 
elements of Tj nitial . That is, 



Note that deciding whnthnr a. n extended logic program has an answer set is also NP-complete 
( [Vlarek fc Truszczynski, 1993 ). 
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J~ Initial {~<F \ F £ T ' i nitia i\ ■ 

Similar notations may be used for other sets, e.g. ^F^ffecv -^Caused^ e ^ c -- 
Definition 10 

Given a domain description T> of AT , two fluent expressions L and L' are mutually 
exclusive in T> if: 

L £ Initial U 3~ 'initial Effect U ^ ' Effect Caused U ^ ' Caused) implies 

L' 4. (Tt .« ,u7,"r i UFt- , u K™ , uK .uk; ,). 

r- V Initial Initial Effect Effect Caused Caused/ 

Intuitively, if two fluent expressions are mutually exclusive, it means that these 
two fluent expressions cannot be both true in any state. Based on the concept of 
mutual exclusion, we will provide a sufficient condition to decide the consistency 
of a domain description. Before we present the result, we need to introduce further 
notions. For a domain description X>, we assign a unique label I to each proposition 
in V so that we can use I to refer a proposition in T>. Let I be a causal or action 
effect proposition in T>. That is, I has one of the following forms: 

L is caused if L\, ■ ■ ■ , L m with absence L m +i, ■ ■ ■ , L n , or 
A causes L if L\, ■ • • , L m . 

We use pre(l), default(l) and eff(l) to denote the set {L\, • • ■ , L m }, {L m+ \, • • • , L n } 
and {L} respectively. Clearly, default(l) = if Z is an action effect proposition or 
the causal proposition does not include absent fluent expressions. For the case that 
I is an initial proposition initially L, pre(l) — default(l) — and eff(l) = {L}. 

Definition 11 

Given a domain description T> of AT . Two propositions / and /' in T> are comple- 
mentary if one of the following conditions holds: 

(i) both I and I' are causal propositions, and eff(l) is a complement of eff(l'); 

(ii) / is a causal proposition, I' is an action effect proposition, and eff(l) is a 
complement of eff(l'), i.e. 

I: F is caused if Zi, ■ ■ ■ , L m with absence L m +i, • • ■ , L n , 
I': A causes -*F if ii, • • ■ , L' k ; 

(iii) both I and /' are action effect propositions of the same action, and eff(l) is a 
complement of eff(l'), i.e. 

I: A causes F if L\, • • ■ , Lh, 
I': A causes ->F if Ii, ■ ■ ■ , L' k . 

Definition 12 

Given a domain description V of AT . V is normal if V satisfies all of the following 
conditions. 

W -^Initial ^ 3~ 'initial ~ ^' 

(ii) For any two causal propositions l\ and Iq in T>, 
effik) n pre(k) = and 
default(l i )neff(l j ) = <D (i,j = l, 2)0; 

13 Note that this condition includes default(li) fl eff(li) = (i = 1,2). 
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(iii) For any pair (1,1') of complementary propositions in T>, there is a pair of 
fluent expressions (L, L') in V such that L and V are mutually exclusive, 
where L £ pre(l) and U £ pre(l'). 

Let us explain the intuition behind a normal domain description in some details. 
Condition (i) ensures a consistent initial state deduced from the domain description 
V. Condition (ii), on the other hand, says that for each causal proposition in T>, 
the complement of its effect should not occur in its preconditions, and furthermore, 
the effect of this causal proposition does not occur in the absence component (i.e. 
the default part) of all other (including itself) causal propositions in V. Finally, 
Condition (iii) represents a non-trivial restriction for complementary propositions 
in V. Since two complementary propositions may cause two complementary fluents 
to be true in some state, this condition actually indicates that if there are two 
complementary propositions in the domain description, then the effects of these 
two propositions cannot be both true in any state. The following theorem gives a 
sufficient condition to guarantee a domain description to be consistent. 

Theorem 2 

Every normal domain description of AT is consistent. 

6.1.2 Characterizing consistent action domains of AT 1 and AT 2 

Now we try to investigate an analogue of Theorem 2 for AT 1 and AT 2 . As AT 2 is 
viewed as an extension of AT 1 , here we only need to consider domain descriptions 
of AT 2 . To achieve our purpose, we must modify the concept of mutual exclu- 
sion of fluent expressions in order to cover observation and abnormal action effect 
propositions in a domain description that are not allowed in AT . In particular, we 
define 

T± = {F | L is observed if • • • after A}, 
= {^F | L is observed if ■ ■ ■ after ^4}. 

As a special case, is formed based on initial observation propositions of V. Let 

^Observe =U-^> and 
Observe ~ U ^4 1 

where each action string A occurs in some observation proposition of V. Under the 
context of AT 2 , we also redefine the following notions: 

■^Effect ~ {T \ A normally causes F if • • •} U 

{F I A abnormally causes F if • • •}, 
•^Effeet ~ {F I A normally causes ->F if ■ ■ ■} U 

{F I A abnormally causes ->F if ■ ■ ■}. 

Given domain description V, we use label / to (uniquely) refer to an observation 
proposition, causal proposition, action effect proposition, or action abnormal effect 
proposition. Then notions pre(l), default (I) and eff(l) arc defined in an obvious 
way. Two fluent expressions L and U are mutually exclusive in V if 
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L G (^Observe U ^Observe ^Effect U ^Effect ^Caused U ? ' Caused) implies 
^ i Observe U 3~ Observe ' Effect U J~ Effect Caused U 3~ Caused)' 

Finally, we should also modify the definition of complementary propositions as 
follows. 

Definition 13 

Given a domain description D of .4T 1 or *4T 2 . Two propositions I and /' in T> are 
complementary if one of the following conditions holds: 

(i) both I and i' are causal propositions and eff(l) is a complement of eff(l'); 

(ii) / is a causal proposition and I' is an action effect or abnormal effect proposition 
and eff(l) is a complement of eff(l'); 

(iii) both I and I' are action effect propositions of the same action where eff(l) is 
a complement of eff(l'), i.e. 

Z: A normally causes F if L\, ■ ■ ■ , L/,, 
Z': ^4 normally causes -iF if • • • , L' k ; 

(iv) both I and /' are action abnormal effect propositions of the same action and 
eff(l) is a complementary of eff(l'), i.e. 

I: A abnormally causes F if L\, ■ ■ ■ , L^, 
I': A abnormally causes ->F if L[, ■ ■ ■ , Z/ fc . 

The following definition then extends the concept of normal domain description 
to AT 1 and AT 2 . 

Definition 14 

Given a domain description V of AT 1 or AT 2 . T> is normal if V satisfies all of the 
following conditions. 

(i) For any action string A occurring in observation propositions of V, JT+nJF^ = 

0; 

(ii) For any two observation or causal propositions l\ and I2 in T>, eff(li)Dpre(k) = 
and default(k) n eff{l 3 ) = = 1,2); 

(iii) For any pair (Z, I') of complementary propositions in T>, there is a pair of 
fluent expressions (L, L') in V such that L and V are mutually exclusive, 
where L e pre(l) and 1/ e pre(l'). 

Theorem 3 

Every normal domain description of AT 1 or AT 2 is consistent. 

6.2 Cause of change on fluents' truth values 

In the rest of the paper, our discussion will focus on consistent action domains. 
First, the following theorem illustrates a basic property of any (consistent) action 
domain of AT showing that a fluent's truth value can only be affected by some 
action effect proposition or causal proposition. 



Theorem 4 
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Let I? be a consistent domain description and V (T>) the corresponding PLP 
translation of V. Then the following results hold. 

(i) If V ^ AT o F after A ■ A and V ^ AT o F after A, then 

V AT "(V) \= Effect+(F,Result(A,S)) or V AT ° (V) \= Caused+(F, Result {A, 5)), 
where S = Result (Ai, • • • , Result (Ai, So) • ■ ■) and ~A = Ai ■ ■ ■ AQ 

(ii) If D h^r" -'■f after 1 • 4 and V ^ AT o ->F after A, then 

T AT \v) \= Effect- (F,Result(A,S)) or V Ar \v) \= Caused' {F , Result{A, S)). 

While the intuition of Theorem 4 is quite clear, it, however, does not hold for 
action domains of AT 1 and AT 2 since observation propositions of the form ( p0| ) is 
allowed in a domain description of AT 1 or AT 2 that may override an inertia rule 
in the corresponding PLP translation and present a change of a fluent's truth value 
even if there is no action or causal rule to cause such a change. In this case, we 
may think that either the fluent's truth value is changed by some external event 
that is not described in the domain description or the domain description is not 
properly specified. Weaker results may be obtained for domains of AT 1 and AT 2 
under some restrictions. 

Theorem 5 

Let T> be a consistent domain description of AT 1 and V Arl (T>) the corresponding 
PLP translation of T>. Suppose each observation proposition in D has the form 

L is observed if Li, ■ ■ ■ , L m with absence L, L m +i, ■ ■ ■ ,L n after A, 

where A is not an empty string of actions. Then the following results hold. 

(i) If V \= AT i ->F after A, and V \= AT i F after A ■ A, then 

V ATl (V) h Effect+{F, Result(A, S)) or T ATl (V) \= Caused+{F, Result(A, S)); 

(ii) If V \= AT i F after A and V \= AT i ->F after A ■ A, then 

V AT \V) |= Effect- (F,Result(A,S)) or V AT \v) \= Caused' (F ', Result{A, S)). 

Theorem 6 

Let T> be a consistent domain description of AT 2 and V at2 (V) the corresponding 
PLP translation of T>. Suppose each observation proposition in T> has the form 

L is observed if L\, ■ ■ ■ , L m with absence L, L m +i, • ■ ■ ,L n after A, 

where A is not an empty string of actions. Then the following results hold. 

(i) If V ^_4r 2 after A, and T> \= A r 2 F after A ■ A, then one of the following 
results holds: 

V AT \V) (= Effect+(F,Result(A,S)); 
V AT {V) (= AbEffect+(F,Result(A,S)); or 
V AT (V) \= Caused+(F,Result{A,S)); 



Without further explanation, this notion is also used in our other statements presented in this 
section. 
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(ii) If T> ^=_4t 2 F after A, and T> H.4T 2 after A ■ A, then one of following 
results holds: 

T Ar \v) \= Effect' {F,Result(A,S)); 
V Ar \v) \= AbEffect-{F,Result(A,S)); or 
V AT {V) \= Caused- (F, Result (A, S)). 

6.3 Restricted monotonicity 

Monotonicity is a desirable property for reasoning about action in the sense that 
whenever new domain specific information is added to a domain description, no 
previous conclusion will be retracted. However, it is well known that most current 
action formulations are nonmonotonic in general. In this subsection, we investigate 
some restricted monotonicity for action domains. Formally, let V be a domain 
description of AT , AT 1 , or AT 2 . A domain description V is called an augment 
of T> if T> C D' and the only extra propositions in V are observation propositions 
(or initial propositions in the case that T> and T>' are domain descriptions of AT ). 

Definition 15 

A domain description V of AT 1 (i = 0, 1, 2) is monotonic with respect to observa- 
tions (we also simply call O-monotonic) if for each augment V of V, V \=at' L 
after A implies V \= AT i L after A (i = 0, 1, 2). 

It is clear that in general O-monotonicity does not hold for any domain descrip- 
tion V due to a possibility that in the PLP translation of V , some new added ob- 
servations may defeat previous conclusions derived through defeasible causal rules, 
incrtial rules, action effect rules or action explanation rules. As an alternative, we 
can investigate proper restricted conditions under which O-monotonicity holds. 

Theorem 7 

Let I? be a domain description AT . T> is O-monotonic if 

(i) each causal proposition in T> is of the form 
L is caused if Li, • • • , L m , and 

( U ) -^initial ^ Effect Caused) = ^' 

initial n Effect paused) = 9, and 

(■F Effect U F Caused) n Effect U ^ Caused) = ®- 

Intuitively, Theorem 7 says that to guarantee a domain description V of AT 
to be O-monotonic, (i) all causal propositions in T> should be non-defeasible, and 
(ii) all fluents involved in initial propositions, action effect propositions and causal 
propositions should be irrelevant in such a way: fluents involved in positive (or 
negative, resp.) initial propositions should be disjoint with fluents involved in neg- 
ative (or positive, resp.) action effect and causal propositions, and fluents involved 
in positive action effect and causal propositions should be disjoint with fluents in- 
volved in negative action effect and causal propositions. Let V Ar (T>) be the PLP 
translation of V. Condition (i) is necessary since this follows that adding any new 
initial fact rules in V Ar (V) will not defeat any fact Holds(F, S) or ^Holds(F, S) 
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that is drived through some causal rules at the initial situation Sq. Condition (ii), 
on the other hand, guarantees that initiating any action effect rules or causal rules 
by adding new initial fact rules into P AT (£>) will not affect any previous facts 
drived through old initial fact rules, action effect rules, or causal rules. 

An analogous result of Theorem 7, however, does not hold for domain descriptions 
of AT 1 and AT 2 . In fact, since both AT 1 and AT 2 allow domain descriptions to 
have observations not only at the initial state but also at any other intermediate 
states, the property of O-monotonicity is hard to be achieved. For instance, consider 
the PLP translation P ATl (V) of a domain description V of AT 1 , if V ATl (V) |= 
Holds(F , S) and Holds(F, S) is derived through instances of action explanation 
rules © and © in V Ar \v): 

Effect + (F',Result(A,S)) <- Holds(F' ', Result(A, S)),not Holds{F' , S), 

not Caused + (F' , Result (A, S)), (49) 

Holds(F,S) «- Effect + (F',Result{A,S)),not^Holds(F,S), (50) 

then adding a new observation rule Holds (F' , S) <~ into V AT (£>), rule Holds(F' , S) 
will always override rule ( f49| ) and the fact Holds(F, S) cannot be derived from the 
new PLP obtained by adding rule Holds (F' , S) <— into V AT (T>). A similar exam- 
ple can be given for a domain description of AT 2 as well. Therefore, in general, 
domain descriptions of AT 1 and AT 2 are not O-monotonic under the condition of 
Theorem 7. 

6.4 Temporal definiteness 

Besides O-monotonicity, there is also a class of action domains that satisfies a so- 
called temporal definiteness property in temporal reasoning. Consider a domain 
description T>. We say that T> is temporally definite if for any value proposition of 
the form ([!]) , V \=at' L after A implies V ^at' L after A' or T> \=xr< L after A 1 
(i = 0, 1, 2), where A is a substring of A', i.e. A' — A- A\ ■ ■ ■ A^. Intuitively, temporal 
definiteness expresses a kind of definite information on fluents' truth values with 
respect to actions. For instance, if the switch is on initially, then we would expect 
that no matter what actions are executed afterward, the switch should be either on 
or off. It would be undesirable if after executing some actions, the status of switch 
becomes unknown. 

As only deterministic actions are considered in our context, the temporal defi- 
niteness seems a reasonable requirement for our temporal reasoning. It is easy to 
verify that domain descriptions T '(Switch- Power) and T>(Switch- Power') described 
in section 3 are temporally definite. However, as defeasible information is allowed 
in domain descriptions, this property does not always hold. 

Example 10 

Consider a scenario where there are constraints: (1) birds normally can fly; (2) a 
wounded bird normally cannot fly. Suppose we initially know that a specific bird 
Tweety is not wounded. Then after being shot, Tweety is wounded. What we are 
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interested in is whether Twecty can fly after she is shot. We name this scenario 
Shooting-3 which can be described by our action language AT . Let T> (Shooting-3) 
be a domain description of AT including the following propositions: 

initially -^Wounded, 
Fly is caused if with absence -'Fly, 
-•Fly is caused if Wounded with absence Fly, 
Shoot causes Wounded. 

Now we translate V {Shooting-?)) into the corresponding PLP V AT ° (Shooting- 
3) = (n° spec U n° nd , J\f, <), where LT° pec consists of the following rules^ 
Initial fact rule : 
Ni : -> Holds (Wounded, Sq) «— , 
Causal rules : 

N2 : Caused + (Fly,s) <— not -1 Holds (Fly, s), 

N3 : Caused^ (Fly , s) <— Holds (Wounded, s), not Holds(Fly, s), 

Action effect rule : 

-/V4 : Effect + (Wounded, Result(Shoot, s)) <— . 

Then it is easy to see V Ar (Shoot-3) \= Holds (Fly, Sq) (e.g. Tweety can fly 
initially). Furthermore, it is also not difficult to conclude that there exist two answer 
sets for V AT (Shoot-3) such that Holds(Fly, Result(Shoot, Sq)) is in one answer set 
and ->Holds(Fly, Result(Shoot, Sq)) is in another. So we have 

V (Shooting-3) \=at° initially Fly, 
T>(Shooting-3) Y=at° Fly after Shoot, 
T>(Shooting-3) Y=AT a ^Fly after Shoot. 

So T> (Shooting-3) is not temporally definite. But intuitively, we would prefer that 
Tweety cannot fly after being shot because causal rule A3 seems to be more specific 
than A^. Solving this problem involves the issue of representing domain-dependent 
preference which will be discussed in section 6.5. 

Lemma 1 

A domain description V of AT 1 (i = 0,1,2) is temporally definite if its PLP 
translation p- 47 " (T>) has a unique answer set. 

The converse of Lemma 1, however, does not hold. That is, for a temporally 
definite domain description, its PLP translation may have more than one answer 
set. For instance, in domain description T>(Shooting-3) described above, if we ini- 
tially know that Tweety is already wounded, then the modified domain description 
becomes temporally definite but its PLP translation will still have more than one 
answer sets, i.e. one answer set includes Holds(Fly, Sq) while the other includes 
Holds (Fly, S Q ). 

Lemma 1 actually presents a sufficient condition to ensure a domain description 
to be temporally definite. Observing Example 10, we can see thatP- 47 " (Shooting-3) 
has more than one answer set because two causal rules N2 and N3 conflict with each 



For simplicity, here we omit the explicit description of naming function JV and <-relations. 
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other on fluent Fly's truth value in situation Result(Shoot, Sq), while Fly's truth 
value is initially true, i.e. P Ar (Shooting-3) |= Holds (Fly, So)- This observation 
motivates our examination on the structure of an action domain. 

Consider an extended logic program II. Using a procedure proposed by Gelfond 
and Lifschitz (see Appendix A), we can actually transform II into a general logic 
program^, denoted by Trans (II). It has been showed that a sufficient condition to 
ensure that Trans (II) has a unique stable model (or answer set under the context 
of extended logic program) is that Trans (II) is locally stratified. That means, there 
does not exist any potential conflict among any rules in Trans (II) (see Appendix A 
for a technical description on local stratification). Therefore, we have the following 
result. 

Theorem 8 

A domain description V of AT 1 (i — 0,1,2) is temporally definite if its PLP 
translation V Ar (V) has a unique reduct A 1 and Trans (A 1 ) is locally stratified. 

Theorem 8 implies that to guarantee a domain description T> to be temporally 
definite, no conflict should occur among the same type of defeasible rules after 
reducing V Ar (T>) to its reduct A 1 . In Example 10, since two causal rules N2 and 
N3 contain a potential conflict with each other, it causes TJ(Shooting-S) to be not 
temporally definite. However, conflicts between different types of defeasible rules 
will not affect the temporal definiteness for a domain description because such 
a conflict can be resolved during the process of generating a reduct of the PLP 
translation of the domain description. 

6.5 Indefiniteness and domain- dependent preferences 

As we mentioned before, temporal definiteness is a desirable property in temporal 
reasoning. However, it is also the fact that sometimes a domain description which 
is not temporally definite may still present right results from our intuition. For 
instance, in the domain of Switch-Power presented in section 3, if we add one more 
causal proposition into T>(Switch- Power): 

-1 On is caused if with absence Power. 

which says that if there is no explicit information stating that there is power, then 
it is assumed that the light is not on, the circumstance will then change. Suppose 
that initially we know that the light is not on, the switch is off, and there is no any 
information about if there is power. Then after turning on the switch, we would 
like to know whether the light is on. It is not difficult to show that the modified 
domain description, say T>(Switch- Power"), is not temporally definite. Specifically, 
we have 

T)(Switch-Power") \=^t° initially ->On, 
V(Switch-Power") \^j,to On after Turn- On, 
V(Switch-Power") Y^at ^On after Turn-On. 

16 A general logic program is a set of rules of the form A <— B\, — , B m , notB m +i, ■ ■ - , notB n , 
where A, B\, — , B n are atoms. Also see Appendix A. 
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Although action Turn- On is deterministic (see its effect proposition in Example 
4 in section 3.2.2), the above indefinite result seems reasonable from our intuition 
because without having definite information about power, it is impossible to decide 
whether the light is on after performing action Turn- On due to a conflict between 
two causal propositions in T>{Switch-Power"). 

This example reveals that although temporal definiteness sometimes indeed de- 
scribes a desired property, it should not become a particular restriction on action 
domains. So far, in our domain descriptions, preferences are used as built-in mecha- 
nisms of their PLP translations to handle conflicts among different types of propo- 
sitions. It is observed that domain-dependent preferences also play important roles 
in temporal reasoning. For instance, in some domains, it is the case that within 
the same type of defeasible propositions, one proposition is more preferred than the 
other. Consider Example 10 presented in section 6.4 once again. We have mentioned 
that two causal propositions 

Fly is caused if with absence -<Fly, 

-i Fly is caused if Wounded with absence Fly, 

contain a conflict under the circumstance by knowing that Tweety is wounded. This 
conflict leads T>{Shooting-?i) to be temporally indefinite. But from our intuition, the 
second causal proposition seems to represent more specific information than the first 
causal proposition. Therefore, during the temporal reasoning, once conflict occurs 
between these two causal rules, we would prefer the second causal proposition to 
defeat the first one (e.g. the wounded bird Tweety normally cannot fly if we do not 
know she can fly). 

This problem may be handled by including domain-dependent preferences on 
causal and observation propositions into the corresponding PLP translations of 
domain descriptions. For instance, in Example 10, we may add preference N3 < N2 
into V AT {Shooting-?)), and then V AT {Shooting-?,) becomes temporally definite 
and the fact ~^Fly after Shoot is entailed from the modified domain description. 

In general, to represent domain-dependent preferences in a domain description, 
we need to extend the language so that preference between two propositions can be 
explicitly expressed. One way of doing this is to introduce labels in the language and 
each proposition in the domain description is assigned a unique label. A preference 
proposition can be proposed as follows: 

li is more preferred than 1%, (51) 

where l\ and I2 are labels for causal or observation propositions in the domain 
description. Then we define the PLP translation of the extended domain description 
as (11, Af, < U <c U <o) (i — 0, 1, 2), where II, M and < are the same as before, and 
< c and < o are the preference orderings on causal and observation rules respectively 



that correspond to the specified preference propositions of the form (51) in the 
domain description. 
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7 Related Work 

In this section, we discuss some related work. In the research of reasoning about 
action, it is difficult to evaluate various action theories from a systematic standard 



though some studies on this topic have been developed, e.g. ( Sandewall, 1994 ). 
To compare with competing approaches, people usually have to demonstrate their 
methods with a small number of typical examples. It is still not clear yet what 
should be the unified standard for an action theory to satisfy. We feel that it would 
be rather weak to compare our approach with other action theories just through 
a small number of examples. As defeasibility handling is the central issue in our 
action formulation proposed in this paper, we will focus on this point as a major 
criterion to compare our approach with other methods. 

An early effort on handling defeasible causal rules in reasoning about action was 



due to the author's previous work (Zhang, 1999), in which the author identified 



the restriction of McCain and Turner's causal theory of actions (McCain & Turner, 



1995) and claimed that in general a causal rule should be treated as a defeasible rule 



in order to solve the ramification problem properly. In (Zhang, 1999), constraints 
([!]) and (^J) simply correspond to defaults Switch : On/ On and ->Power : /->On re- 
spectively. By combining Reiter's default theory ( Reiter, 1980D and Winslett's PMA 



(Winslett, 1988) the author developed a causality-based minimal change principle 
for reasoning about action and change which subsumes McCain and Turner's causal 
theory 



Although the work presented in (Zhang, 199£) provided a natural way to rep 



resent causality in reasoning about action, there were several restrictions in this 
action theory. First, due to technical restrictions, only normal defaults or defaults 
without justifications are the suitable forms to represent causal rules in problem do- 
mains. Second, this action theory did not handle the other two major defeasibilitics 
- defeasible observations and actions with defeasible and abnormal effects. 

Probably Jablonowski, Lukaszewicz and Madalihska-Bugaj's work ( [lablonowski 



ct aL 1996 ) was one of the early efforts on handling the problem of defeasible 
observations and actions with abnormal effects. Following Dijkstra's semantics on 
programming languages ( W. Lukaszewcz fc Madalihsks-Bugaj , 1995| ) , they proposed 



an action theory in which both defeasible observations and actions with abnormal 
effects were expressible. Their work actually presented a few new features. For in- 
stance, by employing Dijkstra's semantics in action theory, their method reduced 
the computational cost in action reasoning; it also dealt with both temporal predic- 
tion and postdiction reasoning while incomplete information is allowable in problem 
domains. 

However, the major limitation of this approach is that it did not solve the rami- 
fication problem properly. To deal with domain constraints in action scenarios, the 
action theory has to be extended by adding statements like 

A; release(Fi); ■ ■ ■ ; release(F n ) , 
which means that fluents F i , • ■ • , F „ involved in domain constraints may not obey 



the inertia rule with respect to the performance of action A (W. Lukaszewcz & 



Madalihsks-Bugaj, 1995| ). For example, if we combine a constraint like "the fact 
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that the turkey is not alive implies that the turkey is not walking into the previous 
shooting scenario, in order to derive an indirect effect -i Walk of action Shoot, a 
statement like Shoot; release(Walk) has to be added into the action theory. But to 
specify such statements, we have to know how each action exactly affects fluents 
involved in the domain constraint. Obviously for a complex problem domain this 
usually is not practicable without taking causality into account. Not surprisingly, 
due to such restriction, this approach is also hard to be extended to handle defeasible 
constraints in reasoning about action. 

Baral and Lobo recently also proposed an action formulation to address the issue 



of defeasible constraints and actions with defeasible effects (Baral fc Lobo, 1997) 



Following a similar spirit of Gelfond and Lifschitz's action language A ( Gclfond| 



fc Lifschitz, 1993] ), Baral and Lobo proposed an action language named ADC to 
describe action domains in which both defeasible constraints and actions with de- 
feasible effects are admitted. In their language ADC a defeasible constraint like ([[]) 
is represented as 

Switch normally suffices for On 

and the defeasible Shoot action illustrated in Example 5 is represented as 
Shoot normally causes ->Alive if Loaded. 



As showed in (Baral & Lobo, 1997), ADC has a simple syntax. Based on an ex- 
tended logic program translation, a transition system is defined to provide a formal 
semantics of ADC. 

It is worth to mention that our idea of defining semantics for AT , AT 1 and 
AT 2 is similar to Baral and Lobo's proposal for ADC. Both of these two approaches 
directly use logic programs to define a transition system for the action language, 
instead of developing a separate semantics like A language. Also, both approaches 
define states in a different way from the standard A language, that is, instead of 
defining a state to be a truth value assignment on fluents, these two approaches 
define a state to be a collection of fluent expressions so that incomplete information 
about fluents becomes allowable. 

Nevertheless, some restrictions exist in action language ADC: it can only reason 
about forward, i.e. temporal prediction, and observations on intermediate situations 
and final situation are not expressible. Therefore, their approach cannot deal with 
temporal postdiction. On the other hand, although actions with defeasible effects 
are allowed in the domain description, it seems that the issue of solving conflicts 
between defeasible action effect propositions and defeasible constraints was not 
addressed in detail. 

Finally, we briefly mention Geffner's recent work on causal theory of action 



( peffner, 1997 ) which is closely related to models of causal reasoning based on 



Bayesian networks and structural equation models (Goldszmidt & Pearl, 1992). 
To provide a well-founded solution to the ramification problem, Geffner claimed 
that causal rules of the domain should be defeasible in general. Although with a 
very different language and methodology, Geffner's system actually addressed the 



same problem discussed in (Zhang, 1999) and (Baral & Lobo, 1997). However, from 
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the viewpoint of defeasibility handling, this system is restricted because defeasible 
observations and actions with defeasible and abnormal effects were not considered. 



8 Conclusions 

We have developed a unified action formulation to handle three types of defeasibil- 
ities in reasoning about action. Our formulation consists of three action languages 
named AT , AT 1 and AT 2 respectively. We have showed that our action formu- 
lation is applicable to both temporal prediction and postdiction with incomplete 
information while defeasible constraints, defeasible observations and actions with 
defeasible and abnormal effects are admitted. As discussed in the previous section, 
although the issue of defeasibility in reasoning about action has been addressed 
by some researchers recently, our work presented here is the first effort to handle 
various defeasible information in temporal reasoning by using a prioritized logic 
programming approach. It enhances the viewpoint that the logic programming lan- 
guages can be employed as efficient low level formal languages for reasoning about 
action. 



Besides the author's work ( Zhang fc Foo, 1997a ), different prioritized logic pro- 



gramming formalisms have been proposed recently, e.g. (Brewka, 1996; Brewka 



Eiter, 1999 ; Grosof, 1997 ). The reason why we choose our PLPs to develop our 
action formulation is as follows. First we think that the answer set semantics for 
PLPs provides an intuitive and natural interpretation for conflict resolution in logic 
programs, and hence it is easy to use not only in reasoning about action, but also 



in other aspects of modeling system dynamics (Zhang & Foo, 19971 ; Zhang & Foo, 



1998). Second, a prepositional prioritized logic programming system (PLPS) has 



been implemented recently by the author and his students (Y et al, 2001). We 
believe that our PLPS can finally provide a practical programming language proto- 
type for representing actions with the capability of the defeasibility handling within 
the framework we proposed in this paper. 

The computational issue of prioritized logic programs has been addressed in the 
author's another work ( Zhang, 2001] ) . Briefly, the author has proved that for a 



propositional prioritized logic program, deciding whether it has an answer set is 
NP-complete, and deciding whether a given ground literal is entailed from this 
prioritized logic program is n^-complete. 

It is also easy to observe that since a rule containing variables in a PLP is viewed 
as a set of ground instances of this rule by replacing variables with all possible 
constants occurring in the PLP, under the case that a PLP does not have function 
symbols, the number of defeated rules eliminated from this PLP as described in 
Definition 2 is always finite. Hence, we can always compute a finite reduct of such 
PLP0. 

In the case that there are function symbols occurring in a PLP, the situation is 
different. Basically, the set of ground instances of a rule, that includes variables and 

17 Note that Theorem 1 shows that every PLP has a reduct, but such a reduct may contain infinite 
rules. 
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function symbols, may be infinite and therefore it might be possible that there are 
infinite number of defeated rules which should be eliminated from the original PLP. 
Under this situation, a reduct containing infinite rules may be produced according to 
Definition 2. From a practical viewpoint, we are only able to deal with finite reducts. 
To overcome this problem, we can set a proper restriction on the variable substi- 
tution. For instance, in the modified Switch-Power domain discussed in Example 4 
(see section 3.2.2), if all we are interested is to know what are the effect after actions 
Cut-Power and Turn-On are executed, then in the computation of the answer set of 
■pAT ^g w it c }i-p ower '^ we only need to consider situations So, Result ( Turn-On, So), 
Result(Cut-Power , So), and Result{Turn-On, Result(Cut-Power , So))- This implies 
that the ground form of PLP V Ar (Switch-Power') only has finite rules and hence 
it always has a finite reduct. 

Finally, we should mention that currently our action formulation cannot repre- 
sent nondeterministic actions and disjunctive domain information. That is, we only 
consider deterministic problem domains in this paper. This is due to the limit of 
prioritized logic programs inherited from extended logic programs. But we would 
argue that our prioritized logic programs are extendedable to represent disjunctive 
information by using a similar method described in ( |Gelfond fc Lifschitz, 1991 ) 
for extended logic programs, and our action formulation can then be extended to 
represent nondeterministic actions. 
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Appendix A: General Logic Programs and Stratification 

A general logic program is a finite set of rules of the form 

A i- Bi, ■ ■ ■ , B m , not B m+ i,---,not B n , (52) 

where A,Bx, ■ ■ ■ , B m , ■ ■ ■ , B n are atoms. 

Gelfond and Lifschitz developed a transformation to reduce an extended logic 



program to a general logic program (Gelfond & Lifschitz, 1991). Consider an ex- 



tended logic program II. For any predicate P occurring in II, let P' be a new 
predicate of the same arity. The atom P'(x) is called the positive form of the neg- 
ative literal ->P(x). Every positive literal is, by definition, its own positive form. 
The positive form of a literal L will be denoted by L + . II + stands for the general 
program obtained from II by replacing each rule Lq <— Lx, ■ ■ • , L m , not L m+ i, ■ ■ -, 
not L n in II by rule 

L% <- Lf, ■ ■ ■ , L+ n , notL+ n+ll notL+- 
Proposition 2 

( pelfond fc Lifschitz, 1991 ) A consistent set S C Lit is an answer set of II iff S + is 



an answer set of II + . 



Definition 16 



(Local stratification (Apt fc Bol, 1994)) 



Let LT be a general logic program. 

• A iocaJ stratification for LT is a function tp from the Herbrand base of II, Bn, 
to the countable ordinals. 

• Given a local stratification ip, we extend it to ground negative literals^] by 
setting ip(not A) = ip(A) + 1. 

• A rule with form ([52]) of II is called locally stratified with respect to a local 
stratification ip if for every ground instance of (&S 



A' <- B[, ■ ■■,B' m , not B' m+1 , • ... no* B' n , 

ip(A') > ip(B-), where 1 < i < m, and 
•tp{A') > ip(notBj), where m + 1 < i < n. 

• LT is called locally stratified with respect to a local stratification ip if all its 
rules are. LT is called locally stratified if it is locally stratified with respect to 
some local stratification. 

Proposition 3 

dGclfond k Lifschitz, 1988)) If a general logic program LT is locally stratified, then by 
treating LT as an extended logic program where each rule does not contain classical 
negation, it has a unique answer set. 



Note that here we mean weak negation not. 
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Appendix B: Proofs 
Theorem 1 Every PLP has a reduct. 

To proof Theorem 1, we need to introduce the concept of <-partition for a PLP. 
Definition 17 

Let V = (LT, TV" <) be an arbitrary PLP. A <-partition of II in V is a finite collection 
{IIi, • ■ ■ ,IIfc}, where II = IL U • • • U IL and Hi and IL, are disjoint for any i ^ j, 
such that 

1. Af(r) < Af(r') E V(<) implies that there exist some i and j (1 < i < j) such 
that r' E Hj and r £ Hi; 

2. for each rule r' E Hj (j > 1), there exists some rule r e II j (1 < i < j) such 
that 

M{r) < AT{r') G P(<). 
Example 11 

Consider a PLP "P 3 = (II, AT, <): 

JVi : A <— not S, not C, 

N 2 : B <- not -.C, 

iV 3 : C <- not A, not -.C, 

iV 4 : -.C <- not C, 

iVi < ^2,^2 < AT 4 , N 3 < N 4 . 

It is easy to verify that a <-partition of LT in Vz is {ni,n 2 ,Il3}, where 

n i: 

Ni : A <- not B, not (7, 
AT 3 : (7 <— not A, not -iC, 

n 2 : 

AT 2 : _B <- not -.C, 

n 3 : 

W 4 : nC <- not C*. 
In fact, this program has a unique answer set {B, C}. 

Lemma 2 

Every prioritized logic program has a <-partition. 
Proof 

For a given PLP V = (II, A/", <), we construct a series of subsets of IT as follows: 
LTi = {r there does not exist a rule r' E LT such that Af(r') < Af(r)}; 
U t = {r\ for all rules such that Af(r') < Af(r), r' E \J^l\ Uj}. 
We prove that {LTi, LT2, • • •} is a <-partition of V. First, it is easy to see that 
IIi and LTj are disjoint. Now we show that this partition satisfies Conditions 1 
and 2 described in Definition 17. Let 7V(r) < Af(r') E V(<). If there does not 
exist any rule r" E LT such that Af(r") < Af(r), then r £ H\. Otherwise, there 
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exists some i (1 < i) such that r G IF and for all rules satisfying Af(r") < Af(r) 
r" e III U • • • U n 2 _i. Let r' G IF. Since 7V(r) < TV(r'), it follows that 1 < j. From 
the construction of IF, we also conclude r G IIi U • ■ • U _ i . Since r' G IF, it 
follows i < j — 1. That is, i < j. Condition 2 directly follows from the construction 
of the partition described above. 

Now we show that {IF,n 2 , • • •} must be a finite set. Firstly, if II is finite, it is 
clear {ITi, n 2 , • • •} must be a finite set. If II contains infinite rules, then according to 
our assumption presented in Section 2, V must be the ground instantiation of some 
program, say V* = (II*, TV*, <*) wherell* is finite. Then we can use the same way to 
define a <-partition for V* . Since II* is finite, the partition of V* must be also finite: 
{III, n|, • • • , II*,}. As V* is well formed, it implies that for each i (i = 1, 2, • ■ •), IF is 
the ground instantiation of II*. So {IF, II 2 , ■ • •} = {IF, n 2 , • • • , IF} which is finite. 

□ 

Proof 

(Proof of Theorem 1) Fct V = (II, A/", <). From Femma 2, we can assume II has a 
partition II = IF U • • • U IF. We will show that V has a fixpoint in the process of 
reduction according to Definition 2. As IF, ■ • • , IF are disjoint and for any M{r) < 
Af(r'), it implies r G IF and r' G IF, where i < j, we can use notation 

IF < IF < • ■ < IF 

to illustrate this property. It is easy to see that for each rule in IF (1 < i < k), 
there must exist some j and h that j < i < h such that JV(r') < J\f(r) < Af(r") and 
r' G IF, , r" G IF . Now we construct a sequence of reductions that starts from those 
least preferred rules in IF, then from rules in IF_i U IF, and so on as illustrated 
below: 

n(°) = n = n!u---un fc; 

11^ — n' - 1 — {n, V2, ■ ■ ■ | n, ra, • • • G IF and n, ri, ■ ■ ■ satisfy the conditions 

as stated in Definition 2}; 
n (2) = n (i) _ r ru r 2 , • ■ ■ | n, r 2 , • • ■ G n 4 _i U Il k and n, r 2 , • ■ ■ satisfy 

the conditions as stated in Definition 2}; 
n (3) = n (2) _ | ri] r2i ... | ri] r2) . . . e n k - 2 U Ilk-! U Il k and n, r 2 , • ■ ■ satisfy 

the conditions as stated in Definition 2}; 

n (*-i) = n (*-2) _ | ri; r2j . . . | ri) r2j ■ ■ • £ n 2 U ■ • ■ U IF- and n, r 2 , • ■ ■ satisfy 

the conditions as stated in Definition 2}. 

It is observed that in the above reduction process, after obtaining IF fe_1 - ) , no 
more rules can be eliminated from n^ -1 ) by applying the conditions of Definition 
2 because after the zth reduction, all orderings inherited from rF_ i+1 < • • • < FF 
will no longer play any roles in the further (i + l)th, • • •, and (k — l)th reductions. In 
particular, in the ith reduction of obtaining IlW, all rules eliminated from IT^ -1 ) 
(note that there may be infinite number of rules to be eliminated in the ith reduc- 
tion) are due to some rules in IF U • • • U Hk-i which are more preferred than those 
eliminated rules in rF_ i+1 U • • • U IF. As k is a finite number, from Definition 2 
n^" 1 ) is also a reduct of V. □ 
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Proposition 1 Let V he a domain description of AT 1 and V AT ' (V) (i = 0, 1, 2) 
the corresponding PLP translation of V specified previously. V is consistent if and 
only if V Ar ' (D) has a consistent answer set. 

Proof 

Here we only prove the result for AT , proofs for other cases are similar. 
Suppose D has a model ^ . Then according to Definition 7, for any action string A 
such that ^f(A) is defined and any fluent F, F and ~^F cannot be both true in ^(A). 
From the definition of ^(A), i.e. Definition 6, it follows that ^(A'-A) G H(A, *(^4 7 )), 
where A = A' ■ A. Here we assume that A is not empty (otherwise, \I/(e) = So 
that we will consider next). Also since 'J's domain is prefix closed, ty(A') is also 
defined. Then from Dcfintiion 5 of transition function 1Z, it follows that 1Z(A, fy(A') 
contains a consistent set of fluent expressions. As this set is directly deduced from 
some answer set Ans of P AT (T>), it concludes that the subset of Ans consisting 
of all literals of the form Holds(F, S) or -^Holds(F, S) is consistent (note S ^ So). 
Now we consider the case of empty action string. In this case Vt'(e) = So- As is a 
model, So must be a consistent set. Again, as So is deduced from some answer set 
Ans of V Ar (£>), it concludes that the subset of Ans consisting all literals of the 
form Holds(F, So) or ^Holds(F, So) is consistent. Therefore, the subset of Ans of 
the following form is consistent: 

{[-i\Holds(F , So), ■ ■ ■} U • • ■ {[-]Holds(F, S), ■ ■ ■}. 

Recall that Ans also contains a subset that consists of atoms of the forms Effect + (F, S), 
Effect~(F,S), Caused + (F,S) and Caused~ (F , S) . Clearly, this subset of Ans is 
also consistent. So Ans is consistent. 

Now suppose Ans is a consistent answer set of V Ar (V). Then from Definitions 
5, 6, and 7, we can construct a model \& for V in an obvious way. □ 

Theorem 2 Every normal domain description of AT is consistent. 
Proof 

Let O be a normal domain description of AT . That is, V satisfies Conditions 
(i), (ii) and (iii) in Definition 12. According to Proposition 1, wc only need to 
show that the PLP translation V AT (V) of V has a consistent answer set. Let 
V AT °(V) = (n, Af, <). First, from Condition (ii) and the construction of V AT ° (V) , 
it is observed that n does not contain rules of the following forms: 

n: Li <— • • • , notL* , ■ ■ ■, 
ri: L2 <—•••, L\, ■■ ■, 

n+i: L* <—•■■, Lk, ■ ■ ■. 

This actually ensures that n has an answer set Ans. To show this, we assume that 
n does not have an answer set. Then there must exist some literal L* satisfying the 
condition: for any set S of ground literals (S can be empty) (a) if L* £ S, then L* is 
in the answer set of program U s (II s is obtained from II by doing Gelfond-Lifschitz 
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transformation on II in terms of S); and (b) if L* e S, then L* is not in the answer 
set of program II 5 . It is worth to mention that since U s does not contain rules 
including negation as failure sign, II 5 always has an answer set. From case (a), it 
is implied that II must contain a rule of the form: 

r'k+i- L* < . 

On the other hand, from case (b), it is easy to observe that all rules of the form 
r' k+1 cannot be triggered in IP due to L* e S. That is, some rule of the form 

n : ii <— • • • , notL* , ■ ■ ■ 

must be contained in II (we do not exclude the case that L\ = L*). This follows 
that rule r' k+1 actually has a form: 

r 'k+i - L* <—•••,£',■■ • 

such that the deletion of r\ from II will cause literal V not to be triggered and 
hence L* can not be derived from II 5 . Without loss of generality, we can assume 
that II contains a sequence of rules n, ■ • • , r k +i as described above. 

Now we consider Condition (i). From Condition (i), we know that II does not 
contain a pair of rules of the forms: 

Holds{F,S ) <-, 
-nHolds(F,S ) <-. 

This follows that a subset of Ans in which each liteal is associated with initial 
situation So- 

{[-.]&>Ms(Fi, fib), • • • , HHolds{F k ,S )} 

is consistent. Now we consider a pair of complementary propositions (I, I') in V. To 
simplify our presentation, for a rule of the form: 

r: Lo <— Li, ■ ■ ■ , L m , notL m+ \, ■ ■ ■ , notL n 

we denote pos(r) = {L\, - ■ ■ ,L m } and neg(r) — {L m+ \, ■ ■ ■ , L n }. Then r can be 
simply represented as Lq <— pos(r), neg(r). Under this notation, a pair of comple- 
mentary propositions I and I' in V may have one of the following possible transla- 
tions in II: 

(a) 

r: Caused + (F,s) <— pos(r), neg(r), 
r': Caused - (F,s) <— pos(r'),neg(r'), 
(b) 

r: Caused + (F,s) <— pos(r),neg(r), 

r': Effect' (F ', Result{A, s)) <- pos(r'),neg(r'), 

(c) 

r: Effect + (F ', Result(A, s)) <— pos(r), neg(r), 
r': Effect^ (F, Result (A, s)) <- pos(r'),neg(r'). 

From Condition (iii), we know that in each case of (a), (b) and (c), pos(r) and 
pos(r') cannot be both true in answer set Ans. Hence, for any situation term S, none 
of these three pairs of atoms Caused + (F,S) and Caused' (F , S) , Caused + (F,S) 
and Effect' (F,S), or Effect+(F,S) and Effect' (F,S) cannot both true in 4ns. 



Handling Defeasibilities in Action Domains 



45 



This concludes that Ans does not contain any complementary literals Holds(F, S) 
and ~^Holds(F , S) for any F and S. So Ans is a consistent answer set of II. Further- 



more, every answer set of II is also consistent (Lifschitz & Turner, 1994). Finally 



from the property that a PLP (II, TV, <) has an answer set iff II has an answer 



set and every answer set of (II, TV, <) is also an answer set of II (Zhang, 2001), it 
concludes that V AT (T>) has a consistent answer set (and its every answer set is 
also consistent). □ 

Theorem 3 Every normal domain description of AT 1 or AT 2 is consistent. 
Proof 

The proof is similar to the proof of Theorem 2 but with additional considerations 
on action explanation rules and action abnormal effect rules in P ATi (T>) . We omit 
it here. □ 

Theorem 4 Given a domain description V of AT and its PLP translation V AT (V) J 
the following results hold. 

(i) lfV AT \v) \= Holds(F,Result(A,S)) andV AT \v) ^ Holds(F,S), then 
V AT °{V) h Effect+{F,Result(A,S))orV AT °(V) \= Caused+ (F , Result [A, S)); 

(ii) IfV AT °(V) \=^Holds{F,Result(A,S)) and V AT ° (V) ^ ^Holds{F , S), then 
V Ar \v) f= Effect- (F,Result{A,S)) or V AT ° (V) \= Caused- (F, Result (A, S)). 

Proof 

It is sufficient to only prove (i). Since V AT \v) |= Holds{F , Result (A, S)) and 
V AT °{V) y= Holds(F,S), it follows that for each answer set Ans of T> AT ° (V), 
Holds(F , Result (A, S)) £ Ans, and there exists some answer set Ans' such that 
Holds(F, S) g Ans'. Therefore, the fact that Holds(F, Result(S , A)) is true is not 
due to inertia rules ( p~7| ) and ( p"8| ) in V Ar (V), but due to action effect rules ( [i"3| ) 
and (pj), or causal rules (§) and ©. That is, V AT " \= Effect+(F, Result(A, s)) or 
P AT ^L Caused+(F,Result{A,S)). □ 

Theorem 5 Let V be a domain description of AT 1 and V AT (V) be its PLP 
translation. Suppose each observation proposition in T> has the form 

L is observed if Li, • ■ ■ , L m with absence L, £ m +i, • ■ ■ ,L n after A, 

where A is not an empty string of actions. Then the following results hold. 

(i) lfV AT \v) ^^Holds{F,S) andV Ar \v) (= Holds (F, Result(A, S)), then 
V AT \V) (= Effect+{F,Result(A,S))orV AT \v) \= Caused+ {F , Result{A, S)); 

(ii) lfV AT \v) \= Holds(F,S) andV AT \v) (= ^Holds(F, Result(A, S)), then 
V AT \V) f= Effect- (F,Result(A,S))orT ATl {V) \= Caused' (F, Result(A, S)). 



Proof 

It is sufficient to only prove (i). As V AT ' (V) |= -^Holds(F,S) and V ATl (V) |= 
Holds(F, Result(A, S)), it is clear that the fact that Holds(F, Result(A, S)) is true 
in each answer set of V Ar (D) is not due to inertia rules (|l7]) , ( p^ ) , ( pq ) and ( p9| ) , 
but due to 
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(1) some observation rules in T Ari (T>), or 

(2) action effect rules (|H3)_and (|lj), or 

(3) causal rules (||) and (|l0|). 

Consider case (1). We suppose there exists some observation rule in V ATl (D) of 
the form 

Holds (F, Result ( A, S)) «- [-,]Holds(Fi, S), ■ ■ ■ , [->]Holds(F m , S), 

not [-i\Holds(F m+ i, S), ■ ■ ■ , not [-<]Holds(F n , S), 

But from the condition, we know that the above observation rule must be of the 
form: 

Holds{F , Result{A, S)) *- ■ ■ ■ ,not-^Holds(F, Result(A, 5)),- • • • 

This results in a conflict with inertia rule (|l8|): 

Holds (J , Result (a, s)) <— Holds(f , s), not -<Holds(f , Resulted, s)) 

in "P AT (T>) . As we specify inertia rules have higher priorities than observation rules 
in V AT (D), it turns out that ^Holds(F , Result(A, S)) is derived. So case (1) is im- 
possible. Hence, only cases (2) or (3) is possible to derive Holds(F , Result (A, s)). 
That is, V ATl {V) \= Effect+{F, Result(A, 5)), or V ATl (V) \= Caused+(F, Result(A, S)). 
□ 

Theoem 6 Let V be a domain description of AT 2 and V Ar (V) he its PLP 
translation. Suppose each observation proposition in T> has the form 

L is observed if L\, ■ ■ ■ , L m with absence L, L m +i, ■ ■ ■ ,L„ after A, 

where A is not an empty string of actions. Then the following results hold. 

(i) HV AT \V) h ~-Holds(F,S) andV AT \v) |= Holds(F, Result(A, S)), then 
one of following results holds: 

V AT \V) \= Effect+{F, Result{A, S)); 
V AT {V) |= AbE£fect+(F, Result(A, S)); or 
V AT (V) h Caused+(F,Result(A,S)); 

(ii) IfV AT (V) \= Holds(F,S) andV Ar \v) |= ^Holds{F, Result(A, S)), then 
one of following results holds: 

V AT \V) (= Effect- (F,Result(A,S)); 
V Ar (V) (= AbEffect~(F, Result(A, S)); or 
V AT {V) \= Caused' (F,Result(A,S)). 

Proof 

The proof of Theorem 6 is similar to that of Theorem 5 as described above. □ 

Theorem 7 Let T> be a domain description AT . V is O-monotonic if 

(i) each causal proposition in T> is of the form 
L is caused if Li, ■ • • , L m , and 

^Ltiat (^Effect ^Caused) = initial n (^Effect ^Kaused) = ®> and 
(•^Effect U ^Caused) 1 " 1 (-^Effect U -^Caused) = ®- 
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Proof 

Let V be an augment of V, V AT ° {V) and T AT °(V) be the PLP translations of V 
and V respectively. To prove the result, it is sufficient to prove that V Ar (T>) \= 
Holds(F, S) implies P AT ° (V) \= Holds(F, S). From the construction of V AT ° (V) , 
it is clear that V AT " (V) \= Holds(F, S) implies 

(1) V AT °(V)^Effect+(F,S), 

(2) V Ar \v) \= Caused+(F,S), or 

(3) V AT ° (V) \= Holds(F, S') due to inertia rules in V AT ° (V), where S = Result(A, S'). 

Adding more observation propositions into V to form V , the new program V AT \V) 
then may have the following effects: 

(a) initiating some action effect rules in V AT ° (T>); 

(b) initiating some casual rules in V Ar (V)\ 

(c) defeating some casual rules in V Ar (V); 

(d) not initiating any action effect and causal rules in V AT (2?). 

Firstly, since each causal proposition in T> has the form 
L is caused if Li , • • • , i m , 

this follows that each corresponding causal rule in V Ar ° (V) is non-defeasible, i.e. 
no negation as failure sign not is included in the body. Hence, the effect (c) will 
not be presented. On the other hand, since both causal rules and action effect rules 
are non-defeasible in V Ar (£>), it is clear that initiating more action effect rules 
or causal rules in AT will not affect the truth values of literals Effect + (F,S) 
and Caused+(F,S) if V AT ° (V) \= Effect+(F,S) and V AT \V) \= Caused+(F,S) 
respectively. 

Now suppose V Ar (V) \= Holds(F, S) is due to some inertia rule in V Ar (V): 
N : Holds(F, S) <- Holds(F, 5"), not^Holds{F ', S), 

where S = Result(A, S'), and V Ar \v) |= Holds(F,S'). We prove V Ar \V) |= 
Holds(F,S). 

Case 1 . Suppose S — So and V Ar (V) \= Holds(F, So). Since no inertia rule is 
needed to drive Holds(F, S ), the only possibility to have V Ar (V) Holds(F, S ) 

is either Holds(F,S ) «- is in V AT °(V'), or Caused* (F, s) < is in V AT " (V) 

(note that such causal rule is non-defeasible). Obviously, in V Ar (T>'), the truth 
value of Holds(F, So) will not be affected. Hence the result holds. 
Case 2 . Now consider the case that S is not the initial situation. Suppose V AT (V) \= 
Holds(F ', S). It implies that there exists some action constant A such that V Ar (V) \= 
Holds(F, S') due to the inertia rule in V AT °{V), where S = Result(A, S'). 
Now suppose V AT \W) ^ Holds(F, S). So the inertia rule: 

N' : Holds(F, S) <- Holds(F, S'), not-, H olds {F , S), 

where S = Result(A, S'), is defeated in V AT " (V). Hence it must be the case that 
F e Effect u F caused)- On the other hand, from the fact that that V AT< \v) |= 
Holds(F,S>), it follows that F £ T+ mtlal or F G (T+ ffect U T Caused ). But this 
contradicts conditions of Theorem 7. So it must have V Ar (V) \= Holds(F, S). 
□ 
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Lemma 1 A domain description V of language AT 1 (i = 0,1, 2) is temporally 
definite if its PLP translation V Ar ' (T>) has a unique answer set. 

Proof 

Let V Ar (V) be the PLP translation of V. From the definition of temporal defi- 
niteness, it is sufficient to prove that V AT \V) \= Holds{F, S) implies T AT ' (D) \= 
Holds(F, Result(A, S)) or P AT '(V) \= ^Holds(F, Result (A, S)) for any action con- 
stant A. Suppose V AT ' (V) |= Holds(F,S) and V AT ' (V) has a unique answer set 
Ans 1 . So Holds(F ', S) 6 Ans 1 . Then, it is clear that if one of the following cases 
holds, the result is true: 

(1) Effect+ [F , Result(A, S)) or Effect- (F , Result (A, S)) is in Ans 1 ; 

(2) AbEffect+(F, Result {A, S)) or AbEffect~(F, Result {A, S)) is in Ans\ here i = 

2; 

(3) Caused + (F , Result (A, S)) or Caused' (F ', Result (A, S)) is in Ans 1 ; 

(4) -nHolds(F,Result(A,S)) is in Ans 1 . 

Now suppose none of the above cases is held. Then from the instance of inertia rule 
in P AT '(V): 

Holds{F, Result(A, S)) <- Holds(F, S),not^Holds(F, Result(A, S)), 

it follows that Holds(F, Result(A, S)) is in Ans 1 . So the result is still true. □ 

Theorem 8 A domain description V of AT 1 (i = 0,1, 2) is temporally definite 
if its PLP translation V AT ' (D) has a unique reduct A 1 and Trans (A 1 ) is locally 
stratified. 

Proof 

The proof is directly from Lemma 1 and Proposition 3 in Appendix A. □ 
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